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Welcome  to  the  11th  Annual  Review  of 
Progress  in  Applied  Computational 
Electromagnetics . 

While  this  year's  conference  retains  the 
flavor  of  past  meetings,  some  changes  have 
been  made.  In  particular,  more  of  the 
sessions  have  been  organized  around  a 
particular  theme,  with  more  invited  papers 
than  in  previous  years.  The  individuals 
responsible  for  these  sessions  are  indicated 
in  the  agenda  as  Session  Organizers,  and 
their  efforts  are  very  much  appreciated. 

Special  recognition  should  be  given  to 
several  individuals  who  contributed 
significantly  to  the  conference.  Paul 
Goggans  provided  the  advertising  for  the 
conference.  Rob  Lee  once  again  did  an 
excellent  job  of  organizing  the  Short 
Courses.  Richard  Gordon  received  all  the 
papers  and  put  them  together  to  produce  these 
Proceedings.  And  of  course  Richard  and  Pat 
Adler  must  be  recognized  for  arranging  for 
the  printing  of  the  Proceedings,  for 
providing  access  to  the  NPGS  Facilities,  and 
for  all  their  other  contributions  to  the 
conference. 


Please  enjoy  your  time  in  Monterey  as 
much  as  possible,  whether  listening  to  a 
paper  being  presented,  catching  up  on  the 
latest  "gossip"  in  the  hallways,  or  enjoying 
the  sights  of  Monterey. 


Raymond  Luebbers 
Technical  Program  Chairman 
1995  ACES  Conference 


The  Link  Between  Developers  and  Users  of  Computational  Electromagnetics  Techniques 
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ACES  PRESIDENT  S  STATEMENT 


It's  nice  to  be  here  in  California  in  March,  especially  since  we  weren't  entirely  certain  that  there  was  going 
to  be  a  California  in  March.  This  state  seems  to  attract  more  than  its  fair  share  of  calamities,  ranging  from 
floods,  fires,  mud  slides,  and  OJ-things. 

In  any  case,  we  re  back  at  ACES’95,  the  Eleventh  Annual  Review  of  Progress  in  Applied  Computational 
Electromagnetics,  which  reverts  to  the  Naval  Postgraduate  School,  our  long-time  home. 

Ray  Luebbers  and  his  committee  have  put  together  an  excellent  program,  one  which  embraces  ‘low 
frequency'  CEM  to  'high  frequency  'CEM.  The  subject  matter  to  be  discussed  ranges  from  the  traditional 
scattering  and  radiation  problems,  so  typical  of  ACES  past,  to  optimization  techniques  in  applied 
electromagnetics.  This  area  has  been  of  interest  to  designers  of  electrical  machines,  but  we  see  that  It  Is 
being  applied  to  hyperthermia  and  superconducting  accelerator  magnets. 

I  am  pleased  to  see  ACES  embrace  all  aspects  of  computational  electromagnetics,  for  there  is  no  single 
'natural'  milieu  of  CEM  for  ACES.  The  problems  of  CEM  are  many  and  varied.  If  one  has  an  interest  in  solving 
Maxwell's  equations,  he  should  be  at  home  at  any  frequency,  in  any  environment. 

You  will  note  that  Ken  Siarkiewicz,  a  long-time  dedicated  supporter  of  ACES,  has  put  together  an  interesting 
session  entitled  'Research  and  Engineering  Framework  for  CEM'.  Ken  then  did  himself  proud  by  releasing 
$10,000  of  MM  ACE  (millimeter- wave,  microwave  advanced  computational  environment)  funding  so  that 
ACK  could  properly  support  this  session.  This  tells  me  two  things:  first,  that  Ken  is  a  pretty  good  man  to  have 
on  your  side,  and  second,  that  ACES  is  increasingly  being  viewed  as  the  preeminent  vehicle  to  expose 
issues  in  computational  electromagnetics. 

ACES  is  reaching  this  stature  because  of  the  dedication  of  its  members  especially  those  who  volunteer  to 
do  its  work.  When  you  publish  next,  think  about  the  ACES  Journal  or  Newsletter  as  the  publication  of  choice. 
When  you  wish  to  give  your  profession  extracurricular  support,  consider  working  for  ACES.  A  lot  of  people 
have,  and  a  lot  more  will. 

Enjoy  the  Eleventh  Annual  Review. 

Harold  A.  Sabbagh 
Sabbagh  Associates,  Inc. 

4635  Morningside  Drive 
Bloomington,  IN  47408 
(812)  339-8273 
(812)  339-8292  FAX 
email;has@sabbagh.com 
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ACES  1995  SHORT  COURSES 

MONDAY  MARCH  20  FULL-DAY  COURSE 

0830-1630  "Finite  Elements  for  Electromagnetics' 

by  John  Brauer,  MacNeal  Schwendler  Corporation 

0830-1630  "GEMACS  From  A-Z"  by  Buddy  Coffey,  Advanced  EM. 

0830-1630  "Physical  Wavelets"  by  Gerald  Kaiser,  University  of  Massachusetts  at  Lowell. 

HALF-DAY  COURSE 

1300-1630  "The  Multiple  Multipole  Program  (MMP):  Theory,  Practical  Use  and  Latest  Features" 
by  Pascal  Leuchtmann,  Swiss  Federal  Institute  of  Technology 

1300-1630  "Verification  and  Validation  of  Computational  Software"  by  E.K.  Miller,  Ohio  University. 

SATURDAY  MARCH  25  FULL-DAY  COURSE 

0830-1630  “Using  Mathematical  Software  for  Computational  Electromagnetics" 
by  Jovan  Lebaric,  Naval  Postgraduate  School. 

0830-1630  "Wire  Antenna  Modeling  Using  NEC"  by  Dick  Adler,  Nava!  Postgraduate  School,  Jim 
Breakall,  Penn  State  University,  and  Gerry  Burke,  Lawrence  Livermore  National  Lab. 

0830-1630  "FDTD,  Generalized  FDTD  and  FVTD  Techniques  in  Solving  Maxwell's  Equations" 
by  Kane  Yee,  Lockheed. 


xvl 


FINAL  AGENDA 

The  Eleventh  Annual  Review  of  Progress  in  Applied  Computational  Electromagnetics 

NAVAL  POSTGRADUATE  SCHOOL 
20  -  25  MARCH,  1995 

Raymond  Luebbers,  Technical  Program  Chairman 
Richard  Gordon,  Proceedings  Editor 
Robert  Lee,  Short  Course  Chairman 
Paul  Goggans,  Publicity  Chairman 
Richard  Adler,  Conference  Facilitator 


MONDAY  20  MARCH 

0830-1630  SHORT  COURSE  (FULL-DAY) 

"Finite  Elements  for  Electromagnetics" 

0830-1630  SHORT  COURSE  (FULL-DAY) 

"GEMACS  from  A-Z" 

0830-1630  SHORT  COURSE  (FULL-DAY) 

"Physical  Wavelets" 

1300-1630  SHORT  COURSE  (HALF-DAY) 

"The  Multiple  Multipole  Program  (MMP):  Theory,  Practical  Use 
and  Latest  Features" 

1300-1630  SHORT  COURSE  (HALF-DAY) 

"Verification  and  Validation  of  Computational  Software" 

0800-2030  CONFERENCE  REGISTRATION 

TUESDAY  21  MARCH 

0700  CONFERENCE  REGISTRATION 

0700-0800  CONTINENTAL  BREAKFAST 

0800  WELCOME  Raymond  Luebbers 

SESSION  1 :  SCATTERING  (parallel  with  Sessions  2  and  3) 

CHAIRS:  V.  CABLE,  E.  MILLER 

0840  "A  CGFFT  Method  Applied  to  the  Scatttering  from  Finite  Size 
Microstrip  Antenna" 

0900  "Analysis  of  Scattering  by  Cluster  of  Nonspherical 
Particles  Based  on  Complete  Mathematic  Models" 

0920  "Analytic  Solution  for  Calculating  the  Radar  Cross-Section  and  Related 
Parameters  of  a  Conducting  Right  Circular  Cylinder  Surrounded  by 
Multiple  Layers  of  Lossy  Dielectrics" 

0940  "RCS  of  High  Permittivity  Cubes  Computed  with  the  TLM 
Method" 

1000  BREAK 

1020  "Scattering  Analysis  of  Antenna  Installations/Panels  on  a  Curved 
Surface  Using  Uniform  Field  Integration  Method" 

1040  "Code  Validation  of  Aircraft  Scattering  Parameters  using  IR 
Thermograms" 


102  Glasgow 

John  Brauer,  MacNeal-Schwendler  Corporation 
122  Ingersoll 

Buddy  Coffey,  Advanced  EM 
101  A  Spanagel 

Gerald  Kaiser,  Univ.  of  Massachusetts  at  Lowell 
325  Ingersoll 

Pascal  Leuchtmann,  Swiss  Federal  Institute 
of  Technology 

323  Ingersoll 

E.  K.  Miller,  Ohio  University 

103  Glasgow  Hall 


103  Glasgow  Hall 


102  Glasgow  Hall 
122  Ingersoll  Hail 


A.  McCowen 


Y.A.  Eremin,  N.W.  Orlov  and 
V.l.  Rozenberg 

G.W,  Jarriel,  Jr.,  M.  E,  Baginski, 
and  Lloyd  Riggs 


C.  Eswarappa  and  W.J.R.  Hoefer 


J.J.  Kim  and  O.B.  Kesler 


J.  Norgard,  R.  Sega,  M.  Seifert, 
T.  Blocher  and  A.  Pesta 
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TUESDAY  21  MARCH 


SESSION  1:  SCATTERING  (parallel  with  Sessions  2  and  3)  (CONT)  122  Ingersoll  Hall 


1100 

"A  New  Method  for  Solving  Scattering  Problems  with  Conducting  Media  in 
the  Time  Domain" 

M.  Schinke  and  K.  ReiB 

1120 

"Experience  and  Experiments  at  Cray  Research  with  JUNCTlON-2" 

J.A.  Crow  and  Q.M.  Sheikh 

1140 

"Quantitative  Methods  for  Measuring  and  Improving  the 

Performance  of  Electromagnetic  Scattering  Codes" 

J.P.  Meyers,  A.  J.  Terzuoli,  Jr., 
and  G.C.  Gerace 

LUNCH 

SESSION  2:  LOW  FREQUENCY  (parallel  with  Sessions  1  and  3) 

CHAIRS:  K.  KUNZ,  H.  SABBAGH 

117  Spanagel  Hall 

0840 

"Numerical  Modelling  of  EMC  in  Underground  Power  Cable 

Systems  with  the  Hybrid  FE-BE  Method" 

J.  Shen  and  A.  Kost 

0900 

"New  Contribution  to  the  Study  of  Fault  Currents  Distribution  in 
the  Ground  Systems" 

H.O.  Brodskyn,  M.H.  Giaroila,  J.R. 
Cardoso,  N.M.  Abe  and  A.  Passaro 

0920 

"On  the  Oscillatory  Phenomena  of  Eddy  Currents  Along 
the  A,  V-'T  Inferface" 

Z.  Cheng,  Q.  Hu,  S.  Gao,  Z.  Liu, 

M.  Wu  and  C.  Ye 

0940 

"A  New  MMP'Code  for  Static  Field  Computation" 

M.  Gnos  and  P.  Leuchtmann 

1000 

BREAK 

1020 

"Molten  Aluminum  Flow  Induced  by  High  Magnetic  Fields" 

W.P.  Wheless,  Jr.  and  C.S.  Wheless 

1040 

"The  Electrostatic  Characterization  of  a  N-Element  Planar 

Array  Using  the  Singularity  Expansion  Method" 

J.E.  Mooney  and  L.  Riggs 

1100 

"A  Volume-Integral  Code  for  Electromagnetic  Nondestructive 

Evaluation" 

R.K.  Murphy,  H.A.  Sabbagh, 

J.C.  Treece  and  L.  W.  Woo 

LUNCH 

SESSION  3:  RESEARCH  AND  ENGINEERING  FRAMEWORK  FOR  CEM  (parallel  with  Sessions  1  and  2)  102  Glasgow  Hall 
ORGANIZER:  K,  SIARKIEWICZ 

0840 

"Research  and  Engineering  Framework  (REF)  for  Computational 
Electromagnetics”  (Invited) 

B.  Hantman,  K.  Siarkiewicz, 

J.  Labelle  and  R.  Jackson 

0900 

"Research  &  Engineering  Framework  (REF)  Data  Dictionary 

Specification  for  Computational  Electromagnetics"  (Invited) 

J.A.  Evans 

0920 

"DT_NURBS  -  A  Geometry  Engine  for  Integration  of  the 

MMACE  Data"  (Invited) 

B.  Ames  and  C.  Whitcomb 

0940 

"Standardized  Grid  Generation  for  the  Research  and 

Engineering  Framework"  (Invited) 

L.W.  Woo,  H.  A.  Sabbagh, 

J,  LaBelle  and  B.  Hantman 

1000 

BREAK 

1020 

"Visualization  and  Standards"  (Invited) 

J.  Cugini 

1040 

"A  Visualization  Toolkit  for  Computational  Electromagnetics”  (Invited) 

B.  Joseph 

1100 

"MMACE  -  Lessons  for  the  Development  of  a  CEM  Computational 
Environment"  (Invited) 

R.G.  Hicks  and  K.R.  Siarkiewicz 

LUNCH 

1200 

BOARD  OF  DIRECTORS  MEETING 

Terrace  Room,  Herrmann  Hall 
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1330-1  730  VENDOR  BOOTHS  AND  WINE  AND  CHEESE  BUFFET 


Barbara  McNitt  Ballroom,  Herrmann  Hall 


1 800  HAPPY  HOUR  {NO  HOST) 

1 900  AWARDS  BANQUET 

1 330-1 530  SHORT  COURSE  {PARTIAL  DAY,  NO  FEE)  Bob  Bevensee 
"Time  Series  Analyses  of  Equity  Stock  Prices 
and  a  Profitable  Investment  Strategy" 

1330-1530  SESSION  4:  INTERACTIVE  TECHNICAL  SESSION 


Barbara  McNitt  Ballroom,  Herrmann  Hall 
Barbara  McNitt  Ballroom,  Herrmann  Hall 
102  Glasgow  Halt 

Barbara  McNitt  Ballroom,  Herrmann  Hall 


SESSION  4A:  EM  THEORY  I 


Barbara  McNitt  Ballroom,  Herrmann  Hall 


"Pulse  Basis  Function  Implementation  of  the  Radiation 
Condition  Integral  Equations" 

"Finite  Difference  Solutions  of  Geometrical  Optics  and 
Some  Related  Nonlinear  PDEs  Approximating  High 
Frequency  Helmholtz  Equation" 

"Conversion  of  Mechanical  Energy  to  Electromagnetic  Energy" 

"Block-Toeplitz-Structure-Based  Solution  Strategies  for  CEM  Problems" 

"The  Two-Dimensional  Finite  Integral  Technique  Combined 
with  the  Measured  Equation  of  Invariance  Applied  to 
Transverse  Electric  Open  Region  Scattering  Problems" 

"Artificial  Transparent  Boundaries  in  Computational  Quasioptics" 

"A  Statistical  Electromagnetics  (STEM)  Research  Initiation  Report" 


P.C.  Colby 

E.  Fatemi,  B.  Engquist  and  S.  Osher 

R.M.  Bevensee 

V. l.  Ivakhnenko  and  E.E.  Tyrtyshnikov 
G.K.  Gothard  and  S.M.  Rao 

A.V.  Popov 

W. P.  Wheless,  Jr.,  C.B.  Wallace  and  W.D.  Prather 


"Optimization  of  Aperiodic  Conducting  Grids" 

"Accurate  MOM  Scattering  Calculations  Using  Massively  Parallel 
Computation" 

"A  New  Angle  on  a  Low  Cost  Ground  Screen  for  Model  Testing  in  the 
Undergraduate  Antennas  Laboratory  (Looking  at  Near  Vertical 
Incidence  Sky  waves  {NVIS)  for  a  Coast  Guard  Patrol  Boat)" 

"Efficient  Extraction  of  the  Near-Field  from  CGFFT  Methods  Applied  to 
Scatterers  in  the  Resonance  Region" 

SESSION  4B:  VISUALIZATION  &  INTERFACES 


R.L.  Haupt 

L.D.  Vann  and  J.S.  Bagby 


M.E,  McKaughan,  W.M. 
Randall  and  B.  Nutter 


A.  McCowen 


Barbara  McNitt  Ballroom,  Herrmann  Hall 


"Computer  Code  for  Field  Calculation  and  Visualization  in  Quasioptics" 
"Dosimetry  in  a  Voxel  Model  of  the  Head" 

"A  Graphical  User  Interface  for  the  NEC-BSC" 

"MF  Communication  and  Broadcast  Prediction  System" 


Y.V.  Kopylov 
P.J,  Oimbylow 

L. W.  Henderson  and  R.J.  Marhefka 

M. J.  Packer  and  A.P.  Tsitsopoulos 


"A  Finite  Difference  Time  Domain  Visualization  Tool 
for  Microsoft  Windows™" 


A.  Z.  Elsherbeni, 

C.D.  Taylor,  Jr.  and  C.  E.  Smith 


SESSION  4C:  VALIDATION 


Barbara  McNitt  Ballroom,  Herrmann  Hall 


"Transformable  Scale  Aircraft-Like  Model  for  the  Validation 
of  Computational  Electromagnetic  Models  and  Algorithms:  Initial 
Configuration  and  Results" 

"Measurement  Study  for  Validation  of  Electromagnetic  Scattering  Codes 
on  a  Complex  3D  Target” 

"Validation  Using  a  Moment  Method  Approach  with  Exact  Object 
Representation” 

"IR  Measurements  for  Validating  EM  Analysis  Tools" 
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D.R.  Pfiug  and  D.  Warren 


T.  Kienberger  and  D.  Jurgens 

J.A.  Larsson,  S.  Ljung  and 
B.Wahlgren 

M.  Seifert,  T.  Blocher  and  A.  Pesta 


SESSION  4D:  EMI/EMC/EMP 


Barbara  McNitt  Ballroom,  Herrmann  Hall 


’’Analysis  of  Electromagnetic  Interference  at  an  Ocean  Observation  Post"  L  Bai  and  J.F.  Dai 


"Enforcing  Correlation  on  Statistically  Generated  EM  Cable  Drivers" 


R.  Holland  and  R.  St.  John 


"Analysis  of  Different  Contributions  to  the  Coupling  Between  Reflector 
Antennas  on  a  Satellite" 


C.  Park  and  P.  Ramanujam 


"Simple  Radiation  Models  in  Lieu  of  EMC  Radiated  Emissions  Testing" 

WEDNESDAY  MORNING  22  MARCH 


0730  CONTINENTAL  BREAKFAST 


0800  ACES  BUSINESS  MEETING 


President  Hal  Sabbagh 


102  Glasgow  Hall 


SESSION  5:  OPTIMIZATION  TECHNIQUES  IN  APPLIED  ELECTROMAGNETICS  (parallel  with  Sessions  6  and  7)  361  Ingersoll  Hall 
ORGANIZER:  O.A.  MOHAMMED 


0840  "An  Optimization  Approach  to  Reduce  the  Discretization  Error  in  Finite 
Element  Explicit  Solution  Sceme"  (Invited) 


M.  Feliziani,  E.  Latini,  F.  Maradei 


0900  "Analysis  and  Design  of  a  Reentrant  Resonant  Cavity  Applicator  for 
Radio  Frequency  Hyperthermia  System"  (Invited) 


Y.  Kanai,  T.  Tsukamoto,  K.  Toyama, 
T.  Kashiwa,  Y.  Saitoh  and 
M.  Miyakawa 


0920  "Analysis  of  Loaded  Cavities  Using  the  Constitutive  Error  Approach" 
(Invited) 


R.  Albanese,  R,  Fresa,  R.  Martone 
and  G.  Rubinacci 


0940  "The  Design  of  Electromagnetic  Devices  using  Knowledge  Based  Systems  D.A.  Lowther,  D.  N.  Dyck  and 
and  Sensitivity  Information"  (Invited)  R.  Rong 


1000  BREAK 


1020  "A  Computer  Program  for  the  Design  of  Superconducting  Accelerator 
Magnets"  (Invited) 


S,  Russenschuck 


1040  "Application  of  Optimization  to  the  Design  of  Electromechanical  Devices" 
(Invited) 


J.K.  Sykulski  and  Y.B.  Cheng 


1100  "Genetic  Algorithms  for  the  Optimal  Design  of  Electromagnetic  Devices" 
(Invited) 


O.A. Mohammed,  G.F.  Uler 


1140  "Linear  Constraints  -  Gradient  Technique  for  the  Inverse  Problem  of 
Design  Optimization"  (Invited) 


A. A.  Arakadan  and 
S,  Subramaniam-Sivanesan 


SESSION  6:  COMPUTATIONAL  ELECTROMAGNETICS  APPLIED  TO  SHIP  DESIGN  (parallel  with  Sessions  5  and  7)  102  Glasgow  Hall 
ORGANIZERS:  J,  NEWCOMB  AND  J.  LOGAN 


0840  "The  Naval  Sea  Systems  Command 

Electromagnetic  Engineering  Program"  (Invited) 

0900  "EM  Engineering  System  Architecture"  (Invited) 

0920  "EM  Engineering  Ray  Tracing  and  Casting  Model 
RTC"  (Invited) 

0940  "Ship  Transition-Frequency  EM  Environment  Analysis 
Requirements"  (Invited) 

1000  BREAK 


D.  Cebulski,  N.  Baron  and  J.  Eadie 


1020  "Finite  Volume  Time  Domain  Analysis  of  Ship  Topside  EM 
Environment  Features"  (Invited) 


B.  Hall,  A.  Mohammadian,  C.  Rowell 
and  V.  Shankar 


1040  "EM  Engineering  Ship  End-To-End  Application"  (Invited) 


L.  R.  Carlson,  C.  F.  Juster,  G.  R.  Allen 
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SESSION  6:  (CONT)  COMPUTATIONAL  ELECTROMAGNETICS  APPLIED  TO  SHIP  DESIGN  (parallel  with  Sessions  5  and  7}  102  Glasgow 

1100  "EM  Engineering  Applied  to  Patrol  Craft  (PC-1)"  (Invited)  D.  Tam,  J.  McGee,  C.  Azu  and  M.  Soyka 

1120  "Shipboard  Antenna  Pattern  Visualization  and  Analysis"  (Invited)  L.C.  Russell,  J.C.  Logan, 

J.W.  Rockway  and  D.F.  Schwartz 

LUNCH 

SESSION  7:  FINITE  DIFFERENCE  TIME  DOMAIN  (parallel  with  Sessions  5  and  6)  122  Ingersoll  Hall 

ORGANIZER:  J.  BEGGS 

0840  "Computational  Analysis  of  Radiation  from  an  Elliptical  Shaped  S.  A.  Biocher,  E.  A.  Baca  and 

End  Radiator"  (Invited)  J-  H.  Beggs 

0900  "A  Time  Domain  Harmonic  Oscillator  Model  for  an  FDTD  Treatment  of  K.  S.  Kunz 

Lossy  Dielectrics"  (Invited) 

0920  "FDTD  Modeling  of  Electromagnetic  Wave  Interactions  with  Composite  J.G.  Maloney  and  B.L.  Shirley 

Random  Sheets”  (Invited) 

0940  "An  Improved  Near  to  Far  Field  FDTD  Algorithm"  (Invited)  K.  S.  Kunz 

1000  BREAK 

1020  "Unstructured  Finite-Volume  Modeling  in  Computational  Electromagnetics"  D.J.  Riley  and  C.D.  Turner 

(Invited) 

1040  "Scattering  from  Coated  Targets  Using  a  Frequency-  C.W.  Penney,  R.J.  Luebbers  and 

Dependent,  Surface  Impedance  Boundary  Condition  in  FDTD"  J.W.  Schuster 

1100  "Hybrid  Finite  Difference  Time  Domain  and  Finite  Volume  Time  Domain  in  K.S.  Yee  and  J.S.  Chen 

Solving  Maxwell's  Equations"  (Invited) 

1120  "Reducing  the  Number  of  Time  Steps  Needed  for  FDTD  Antenna  and  R.  Luebbers  and  H.S.  Langdon 

Microstrip  Calculations"  (Invited) 

1 140  "Numerical  Simulations  of  Light  Bullets,  Using  the  Full  Vector,  Time  P.  Goorjian  and  Y.  Silberberg 

Dependent,  Nonlinear  Maxwell  Equations" 

LUNCH 


SESSION  8;  BERENGER’S  BOUNDARY  CONDITION  (parallel  with  Sessions  10  and  11)102  Glasgow  Hall 
ORGANIZER:  J.  FANG 


1320 

"Ultrawideband  Termination  of  Waveguilding  and  Multilayer 

Structures  for  FD-TD  Simulations  in  2-D  and  3-D"  (Invited) 

C.E.  Reuter,  R.M.  Joseph,  E.T. 
Thiele,  D.S.  Katz  and  A.  Taflove 

1340 

"A  3-D  Perfectly  Matched  Medium  by  Coordinate  Stretching 
and  Its  Absorption  of  Static  Fields"  (Invited) 

W.C.  Chew,  W.H.  Weedon 
and  A.  Sezginer 

1400 

"Perfectly  Matched  Anisotropic  Absorbers  for  Finite  Element 
Applications  in  Electromagnetics" 

D.M.  Kingsland,  Z.S.  Sacks 
and  J.F.  Lee 

1420 

"Modification  of  Berenger's  Perfect  Matched  Layer  for  the 

Absorption  of  Electromagnetic  Waves  in  Layered  Media" 

M.  Gribbons,  S.K.  Lee 
and  A.C.  Cangellaris 

1440 

"Performance  of  the  Perfectly  Matched  Layer  in  Modeling 

Wave  Propagation  in  Microwave  and  Digital  Circuit  Interconnects" 

Z.  Wu  and  J.  Fang 

1500  BREAK 


SESSION  9:  TIME  DOMAIN/FDTD  (parallel  with  SesstonslO,  11  and  12} 
CHAIRS:  L.  LONG,  J.  MALONEY 


102  Glasgow  Hall 


1520  "A  FVTD  Algorithm  for  Maxwell’s  Equations  on  V,  Ahuja  and  L.N.  Long 

Massively  Parallel  Machines" 

1540  "The  Piecewise  Linear  Recursive  Convolution  Method  for  D.F.  Kelley  and  R.  J.  Luebbers 

Incorporating  Dispersive  Media  into  FDTD" 

1600  "Combining  Different  Coordinate  Systems  in  the  Time  M.  Mrozowski,  M.  Okoniewski, 

Domain  Finite  Difference  Method"  M.A.  Stuchly  and  S.S.  Stuchly 

1620  "Time  Domain  Response  of  Simulated  2D  Composite  Scatterers"  A. 2.  Elsherbeni  and  P.  Goggans 

1640  "An  Object-Oriented  Approach  to  Writing  M.  Zimmerman  and  P.  Mallasch 

Computational  Electromagnetics  Codes" 

SESSION  10:  FAST  ALGORITHMS  FOR  COMPUTATIONAL  ELECTROMAGNETICS  (parallel  with  Sessions  8,  9,  1 1.  and  12) 

ORGANIZERS:  E.  MICHIELSSEN  AND  W.  CHEW  122  Ingersoll  Hall 

1320  "On  the  Use  of  Wavelet-L'ke  Basis  Functions  in  the  Finite  Element  R.K.  Gordon 

Analysis  of  Elliptic  Problems"  (Invited) 

1340  "Fast  Wavelet  Algorithm  (FWA)  for  Moment  Method  Analysis  of  K.  Sabetfakhri  and  L.P.B.  Katehi 

Electromagnetic  Problems"  (invited) 

1400  "Fast  Far  Field  Approximation  for  Calculating  the  RCS  of  Large  Objects"  C.C.  Lu  and  W.C.  Chew 

(Invited) 

1420  "The  Parameter  Estimation  Technique  (PET):  Speeding  Up  Dense  Matrix  C.  Hafner  and  J.  Frohlich 

Methods"  (Invited) 

1440  "A  Novel  Scheme  for  Massively  Parallel  Solution  of  Maxwell's  Equations  M.A.  Jensen,  Y.  Rahmat-Samii 

using  FDTD"  (Invited)  and  A.  Fijany 

1500  BREAK 

1520  "Reduction  of  the  Filling  Time  of  Method  of  Moments  Matrices"  (Invited)  G.  Vecchi,  P.  Pirinolt,  L.  Matekovits  and  M. 

Orefice 

1540  "The  Fast  Multipole  Method  for  Large  2d  Scatterers"  (Invited)  L.R.  Hamilton,  J.J.  Ottusch, 

M.A.  Stalzer,  R,S.  Turley, 

J.L,  Visher  and  S.M.  Wandzura 

1600  "A  Multilevel  Matrix  Decomposition  Algorithm  for  Analyzing  Scattering  E.  Michielssen  and  A.  Boag 

from  Large  Structures"  (Invited) 

1620  ”A  3D  Fast  Multipole  Method  for  Electromagnetics  with  Multiple  Levels"  B.  Dembart  and  E.  Yip 

(Invited) 

1640  "Fast  Multipole  Method  Solution  of  Combined  Field  Integral  Equaltion"  J.M.  Song  and  W.  C.  Chew 

SESSION  1 1 :  MICROWAVE  AND  GUIDED  WAVE  (parallel  with  Sessions  8  and  10)  361  Ingersoll  Hall 

CHAIRS:  P.  GOGGANS,  A.  TERZUOLI 

1320  "Computer-Simulation  of  Isotropic,  Two-Dimensional  Guided-Wave  R.A.  Specials 

Propagation" 

1340  "Analysis  of  Ultra-Short  Pulse  Propagation  on  Uniform  and  Tapered  Printed  R.A.O.  Veliz  and  J.R.  Souza 

Transmission  Lines" 

1400  "Wave-Field  Patterns  on  Electrically  Large  Networks"  R.A.  Specials 

1420  "Scattering  Characteristics  of  Dissimilar  Waveguide  Slot  Couplers"  A.  Singh  and  K.S.  Christopher 

1440  "An  Alternative  Formulation  of  the  Tranverse  Resonance  Technique"  A.G.  Neto,  S.  Ariguel,  H.  Aubert, 

D,  Bajon  and  H,  Baudrand 

1500  BREAK 
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SESSION  12:  MOM  (parallel  with  Sessions  9  and  10} 

CHAIRS:  A.  PETERSON,  R.  ZIOLKOWSKI 

361  Ingersoll  Hall 

1520 

"Moment  Method  Analysis  of  Dielectric  Covered  Radiating  Slots  Using 
Alternative  Green's  Function  Approach" 

S.  Christopher,  V.V.S.  Prakash, 

A.K.  Singh  and  N.  Balakrishnan 

1540 

"Computation  of  E-field  Distribution  of  Low  Gain 

Antenna  on  Conducting  Body  of  Revolution" 

J.  Liu,  J.  Wang  and  Y.  Gao 

1600 

"An  Implementation  of  an  Exact  Scheme  for  Problem  Decomposition 

Via  the  Use  of  Aperture  Admittance" 

D.L.  Wilkes,  C-C.  Cha  and  T.  Krauss 

1620 

"Parallelization  of  the  Parametric  Patch  Moment  Method  Code" 

X.  Shen,  G.E.  Mortensen,  C.C.  Cha, 

G.  Cheng  and  G.  C.  Fox 

1640 

THURSI 

"A  Tool  Box  for  Parallelization  of  Moments  Method  Codes" 

3AY  MORNING  23  MARCH 

E.  Yip,  B.  Blakely,  L.  Johnson, 

D.  Jurgens  and  R.  Kochhar 

0730 

CONTINENTAL  BREAKFAST 

SESSION  13:  RECENT  DEVELOPMENTS  IN  FDTD  ANALYSIS  (parallel  with  Sessions  14  and  IS)  102  Glasgow  Hall 

ORGANIZERS:  M.  PIKET-MAY  AND  D.  KATZ 

0840 

"Simulation  of  Microwave  Circuits  by  FDTD  Method"  (Invited) 

C.N.  Kuo,  B.  Houshmand  and  T.  Itoh 

0900 

"Adaptation  of  FDTD  Techniques  to  Acoustic  Modeling"  (Invited) 

J.G.  Maloney  and  K.E.  Cummings 

0920 

"FDTD  Investigation  of  the  Antenna-Tissue  Interaction  for  Cellular  and 
Satellite  Systems"  (Invited) 

Y.  Rahmat-Samii  and  M.A.  Jensen 

0940 

"FDTD  Modeling  of  Ground-Penetrating  Radar  Antennas" 

BJ.  Zook 

1000 

BREAK 

1020 

"FDTD  Modeling  of  Ultrashort  Optical  Pulse  Interactions  with 

Nonresonant  and  Resonant  Materials  and  Structures"  (Invited) 

R.W.  Ziolkowski 

1040 

"Time  Domain  Analysis  of  Electromagnetic  Wave  Propagation  in  Nonlinear 
Dielectric  Slab" 

G,  Miano,  C.  Serpico, 

L.  Verolino  and  F.  Villone 

1100 

"An  Efficient  Sub-gridding  Algorithm  for  FDTD." 

D.T.  Shimizu.  M.  Okoniewski  and  M.M.  Stuchly 

1120 

"Using  the  Integral  Forms  of  Maxwell's  Equations  to  Modify  and  Improve 
the  FDTD  (2,4)  Scheme" 

M.F.  Hadi  and  M.  Piket-May 

1140 

"From  the  Berenger  PML  ABC  to  Micro-Lasers:  Recent  Advances  in  FD-TD 
Modeling  Techniques"  (Invited) 

A.  Taflove 

LUNCH 

SESSION  14:  PROPAGATION  (parallel  with  Sessions  13  and  151 

ORGANIZER:  K.  CHAMBERLIN 

361  Ingersoll  Hall 

0840 

"Terrain  and  Refractivity  Effects  in  a  Coastal  Environment: 

Results  from  the  VOCAR  Experiment" 

A.  Barrios 

0900 

"Capabilities  and  Limitations  Associated  With  Using  GTD  to  Model 
Propagation  Path  Loss  in  the  Presence  of  Irregular  Terrain" 

K.  Chamberlin 

0920 

"Comparison  of  Electromagnetic  Wave  Propagation  Computer 

Programs" 

S.A.  Fast  and  T.  H.  Koschmieder 

0940 

"A  Model  for  Estimating  Electromagnetic  Wave  Attenuation 
in  a  Forest  (EWAF)  Environment" 

C.  Welch,  C.  Lemak,  and  L.  Corrington 

1000 

BREAK 

1020 

"Validation  of  the  Radio  Physical  Optics  Propagation  Model" 

R.A.  Paulus 

xxiii 

THURSDAY  MORNING  23  MARCH 

SESSION  14:  PROPAGATION  (parallel  with  Sessions  13  and  15)  (CONT) 

1040  "VTRPE:  A  Variable  Terrain  Electromagnetic  Parabolic 
Equation  Model" 

1 100  "Estimating  Tropospheric  Refractivity  Fields  Using  a  Nonlinear 
Gauss-Markov  Procedure  and  the  PE  Model” 

1120  "Modeling  of  Radio  Wave  Ducting  Over  Regular  Boundary" 

LUNCH 

SESSION  15:  PARALLELIZATION  OF  EM  CODES  (parallel  with  Sessions  13  and  14) 
ORGANIZERS:  J.  VOLAKIS  AND  A.  CHATTERJEE 

0840  "Advances  in  Time-Domain  CEM  Using 

Massively  Parallel  Architectures"  (Invited) 

0900  "Parallel  Solutions  of  Maxwell's  Equations  on  the  Meiko  CS-2"  (Invited) 


0920  "Parallelization  of  the  CARLOS-3D  Method  of  Moments  Code"  (Invited) 


0940  "Parallel  Computing  for  Electromagnetism  at  ONERA"  (Invited) 
1000  BREAK 

1020  "The  Performance  of  the  Parallel  Solution  of  the  Quasi-Minimal 
Residua!  (QMR)  Method  on  2D  Mesh  Architectures"  (Invited) 

1040  "Advanced  Parallel  Solver  Techniques”  (Invited) 

1100  "Parallelized  FDTD  for  Antenna  Radiation  Pattern  Calculations" 


1 120  "Calculation  of  Electromagnetic  Fields  with  the  Multiple  Multipole 
Method  (MMP  Method)  on  Parallel  Computers" 

1 140  "Implementation  of  the  Finite-difference  Time-Domain  Method  on  Parallel 
Computers" 

LUNCH 

THURSDAY  AFTERNOON  23  MARCH 

SESSION  16:  EM  THEORY  II  (parallel  with  Sessions  18  and  20) 

CHAIRS:  K.  YEE,  R.  GORDON 

1320  "FDTD  Investigation  of  the  Ability  to  Increase  Electromagnetic 
Fields  Around  Head  Tumors" 

1 340  "FDTD  and  PMM  Based  Design  of  a  TEM  Horn  Antenna  with  Reduced 
Off-Boresight  Fields" 

1400  "Determination  of  the  Complex  Aperture  Distribution  of  a  Planar 
Spiral  Antenna  from  3D  Far-Field  Radiation  Pattern  Data" 

1420  "Analysis  of  Micro-Contamination  of  Silicon  Wafers  Based 
on  Discrete  Sources  Method  (DSM)" 

1440  "Analysis  of  Convergence  Properties  of  Projection 
Methods  for  Solving  CEM  Applications" 


361  Ingersoll  Hall 
F.J.  Ryan 

D.  Boyer  and  F.J.  Ryan 

I. P.  Zolotarev 

122  Ingersoll  Hall 

C.  Rowell,  V.  Shankar,  W.F.  Hall 
and  A.  Mohammadian 

N.  Madsen,  B.  Erne,  D.  Steich 
and  G.  Cook 

J. M.  Putnam,  D.D.  Car 
and  J.D  Kotuiski 

A.  de  La  Bourdonnaye,  A.  Cosnuau, 

X.  Ferriferes,  P.  Leca  and  F.  X.  Roux 

L.  Hamandi,  F.  Ozguner  and  R.  Lee 

A.S.  King 

Z.M.  Liu,  A.S,  Mohan,  T.  Aubrey  and  W.R. 
Belcher 

C.  Tudziers  and  H.  Singer 

R.S.  David  and  L.T.  Wille 


361  Ingersoll  Hall 

D.B.  Dunn,  A.J.  Terzuoli,  Jr, 

G.C.  Gerace  and  C.A.  Rappaport 

D.J.  Wolstenholme, 

A.J.  Terzuoli,  Jr,  and  G.  C.  Gerace 

M.  Kiuskens,  W.  Lippincott 
and  M.  Kragalott 

Y.A.  Eremin  and  N.W.  Orlov 

V.l.  Ivakhnenko,  A.V.  Kukk,  E.E. 
Tyrtyshnikov,  A.Y.  Yeremin 
and  N.L.  Zamarashkin 


XXIV 


SESSION  17:  ELECTROMAGNETIC  MODELING  TECHNIQUES  FOR  INTEGRATED  OPTICS  {parallel  with  Sessions  18,  19  and  20) 
ORGANIZER:  A.  CANGELLARIS  361  Ingersol!  Hall 


1520  "Analysis  and  Design  of  Guided-wave  Optica!  Devices  Using 
Finite-Difference  Time-Domain  Method"  (Invited) 

1540  "Vectorial  Analysis  of  Optical  Waveguides  by  the  Method  of 
Lines"{lnvited) 

1600  "Vector  Finite  Element  Analysis  of  Lossless  and  Lossy 
Dielectric  Waveguides"  (Invited) 

1620  "NL-FDTD  Modeling  of  Linear  and  Nonlinear  Corrugated 

Waveguiding  Systems  for  Integrated  Optics  Applications"  {Invited) 

1640  "Analysis  of  Coupled  Nonlinear  Optical  Waveguides  by  Matrix  Method" 


S.i.  Chaudhuri  and  S.T.  Chu 

R.  Pregla  and  W.  Pascher 

P.  Cheung  and  A.  Gopinath 

R.W.  Ziolkowski  and  J.B.  Judkins 

V.  Tripathi,  A.  Weisshaar  and  H.S.  Chang 


SESSION  18:  TOPICS  IN  FRACTAL  AND  WAVELET  ELECTRODYNAMICS  {parallel  with  Sessions  16,  17and  20)  102  Glasgow  Hall 
ORGANIZERS:  D.H.  WERNER  AND  P.L.  WERNER 


1 320  "An  Overview  of  Fractal  Electrodynamics  Research"  (Invited) 

1340  "Fractal  Arrays  and  Fractal  Radiation  Patterns"  (Invited) 

1400  "Wavelet  Transforms  and  Time/Time-scale  Analysis"  (Invited) 

1420  "Wavelet-based  Processing  to  Efficiently  Achieve  Broadband 
Monostatic  and/or  Passive  Cross-sensor  Processing"  (Invited) 

1440  "The  Intervallic  Wavelets  with  Application  in  the  Surface  Integral 
Equations"  (Invited) 

1 500  BREAK 

1520  "Radar  Cross  Section  Data  Reduction  Using  Wavelets" 

SESSION  19:  NEC  APPLICATIONS  (parallel  with  Session  17  and  20) 

ORGANIZER:  J.  BREAKALL 

1540  "Computationally  Efficient  and  Accurate 

Approximations  for  Impedance  Matrix  Elements  of 
NEC-Type  Method  of  Moments  Formulations" 

1600  "Development  of  the  Coupled-Resonator  Antenna  Principle 
A  Computer  Modeling  Case  History" 

1620  "Antenna  Design  &  Development  Using  NEC-WIN" 


D.H.  Werner 

P.L.  Werner,  D.H.  Werner  and  A.J.  Ferraro 
R.K.  Young  and  T.G.  Golsberry 
R.K.  Young  and  L.H.  Sibul 

G.W.  Pan  and  J.Y.  Du 


A.S.  Ali,  S.E.  Duval  and  R.L.  Haupt 

1 02  Glasgow  Hall 

D.H.  Werner,  S.E.  Metker 
and  J.A.  Huffman 


G.  Breed 


T.  A.  Erdley,  J.  J.  Shapiro, 

J.  S.  Young  and  J.K.  Breakall 


1640  "The  "Paint"  System  A  UTD/NEC  Hybrid  Package  for  Simulating  Antenna  J.S.  Young  and  J.K.  Breakall 
Patterns  Over  3-Dimensional  Irregular  Terrain" 

SESSION  20:  FEM  (parallel  with  Sessions  16,  17.  18,  and  19)  122  Ingersoll  Hall 

CHAIRS:  R.  BURKHOLDER,  J.  KARTY 


1320  “Numerically  Characterizing  Electromagnetic  Fields  Local  to  the  Edge  of  A.S.  Ali  and  C.  L.  Holloway 
a  Conducting  Strip  Using  a  Matched  Asymptotic  Technique  and  the  Finite 
Element  Method" 

1340  "An  Enhanced  "A  Posteriori"  Remeshing  Algorithm  for  Adaptive  Meshing  P.  Girdinio,  A.  Manella  and 
of  2D  Finite  Element  Problems"  Molinart 


1400  "Finite  Element  Analysis  of  Waveguides  Using  Edge-Based  Magnetic  J.  -F.  Lee,  G.  Lizalek  and  J.  Brauer 

Vector  Potential  and  Nodal-Based  Electric  Scalar  Potential" 

1420  "A  Scattering  Analysis  of  Laser  Beam  Wave  by  Groove  Pits  on  Optical  Y.  Miyazaki  and  K.  Tanaka 

Memory  Disk  by  Using  FEM  with  BEM" 

1440  "3D  Nodal-  and  Mixed-Based  Elements  for  Unbounded  A.  Nicolas,  L.  Nicolas  and  J.L.  Yao-bt 

Microwave  Problems" 


1500  BREAK 


XXV 


THURSDAY  AFTERNOON  23  MARCH 

SESSION  20:  FEM  (parallel  with  Sessions  16.  17,  18.  and  19)  (CONT) 

1520  "A  Rationale  for  the  Use  of  Mixed-order  Basis  Functions  Within  Finite 
Element  Solutions  of  the  Vector  Helmholtz  Equation" 

1540  "Finite  Element  Waveguide  Simulator  Techniques" 

1600  "A  Solution  for  Open  Boundary  Electromagnetic  Field  Problems  by 
Mapped  Infinite  and  Virtual  Elements" 

FRIDAY  MORNING.  MARCH  24 

0730  CONTINENTAL  BREAKFAST 


122  Ingersoll  Hall 

A.F.  Peterson  and  D.R.  Wilton 

J.R.  Sanford  and  N.M.  Johansson 
L.H.A.  de  Medeiros  and  A.  Raizer 


SESSION  21:  EM  ANALYSIS  TECHNIQUES  FOR  ELECTRICALLY  LARGE  CAVITIES  (parallel  with  Sessions  22  and  23) 
ORGANIZER:  D.  PFLUG  122  Ingersoll  Hall 


0840  "Application  of  Modal  and  Plane  Wave  Expansions  to  Modeling  Large  J.  L.  Karty  and  J.M.  Roedder 

Jet  Engine  Cavities"  (Invited) 


0900  "Scattering  from  Dielectric  Loaded  Cavities  Using 
Shooting  and  Bouncing  Rays"  (Invited) 

0920  "Xpatch  Simulation  of  Large  Inlet  Structures"  (Invited) 

0940  "An  Iterative  Physical  Optics  Approach  for  the  Em  Analysis  of 
Cavities  and  Other  Multi-Bounce  Geometries"  (Invited) 

1 000  BREAK 

1020  “Improved  Ray  Basis  in  the  Hybrid  Analysis  of  EM  Scattering 
by  Large  Open  Cavities"  (Invited) 


M.  Christensen,  S.  W.  Lee 
D.J.  Andersh 

R.  Bhalla  and  H.  Ling 

R.J.  Burkholder 


R.J.  Burkholder,  P,H.  Pathak, 

H.T.  Chou,  D.  Andersh  and  J.  Path 


1040  "Overlapping  Modal  and  Geometric  Symmetries  for  Computing  Jet  Engine  D.C.  Ross,  J.L.  Volakis, 

Engine  Inlet  Scattering"  (Invited)  H.  T.  Anastassiu  and  D.  Andersh 


LUNCH 


SESSION  22:  ACCURACY  ESTIMATION  IN  ELECTROMAGNETIC  MODELING  (parallel  with  Sessions  21  and  23)  109  Glasgow  Hall 
ORGANIZER:  S.M.  WANDZURA 


0840  "Assessing  the  Influence  of  Coefficient  Accuracy.  Matrix  Condition 
Number,  Size  and  Type,  and  Computer  Precision  on  Matrix-Solution 
Accuracy"  (Invited) 

0900  "Numerical  Accuracy  Issues  in  Finite  Element  Frequency  Domain 
Solutions  of  Radar  Scattering  Problems"  (Invited) 

0920  "Accuracy  in  Computation  of  Matrix  Elements  of  Singular  Kernels" 

0940  "Accuracy  Estimation  and  High  Order  Methods" 

1000  BREAK 

1020  "Accuracy  Issues  in  Time-Domain  CEM  Using  Structured/Unstructured 
Formulations"  (Invited) 

1040  "An  Accuracy  Study  for  the  3D  Hybrid  Finite  Element  Method  of 
Moments  SWITCH  Code"  (Invited) 

1 100  "Modeling  Accuracy  of  3D  Method  of  Moments  Techniques" 

1 120  "Requiring  Quantitative  Accuracy  Statements  in  EM  Data"  (Invited) 
LUNCH 


E.  K.  Miller 


J.  D'Angelo 


S.M.  Wandzura 

L. R.  Hamilton,  J.J.  Ottusch, 

M. A.  Stalzer,  R.S,  Turley, 

J.L.  Visher  and  S.  M.  Wandzura 


V.  Shankar,  W.  F.  Hall 
and  S.  Palaniswamy 

G.E.  Antilla  and  Y.C.  Ma 


M.B.  Gedera,  L.N,  Medgyesi-Mltschang,  R.A, 
Pearlman,  J.M.  Putnam.  D-S,Y,  Wang 

E.K.  Miller 


XXVJ 


SESSION  23:  PDE  METHODS  IN  ELECTROMAGNETICS  (parallel  with  Sessions  21  and  22)  102  Glasgow  Hall 
ORGANIZERS:  R.  LEE  AND  J.  -F.  LEE 


0840  “Optimization  Issues  in  Finite  Element  Codes  for  Solving  Open  Domain  A.  Chatterjee  and  J.L.  Volakis 

3D  Electromagnetic  Problems"  (Invited) 

0900  "A  Characteristic-Based  3D  Time  Domain  Maxwell  Equation  Solver"  J.S.  Shang  and  K.C.  Hill 

(Invited) 

0920  "Finite  Element  Solution  of  Eddy  Current  Problems  in  Electromagnetics"  O.A.  Mohammed  and  G.  F.  Uler 

(Invited) 

0940  "Ten  Years  of  Evolution  of  the  FDTD-like  Conformal  Technique"  (Invited)  K.S.  Yee 

1 000  BREAK 

1020  "Whitney  Elements  Time  Domain  (WETD)  Methods  for  Solving  Three-  J-  -F.  Lee 

Dimensional  Waveguide  Discontinuities”  (Invited) 

1040  "An  FDTD/FVTD  2D-algorithm  to  Solve  Maxwell’s  Equations"  J.S.  Chen,  J.V.  Prodan  and  K.S.  Yee 

1100  "Spectral  Finite  Methods  for  the  Simulation  of  Electromagnetic  A.C.  Cangellaris  and  D.  Hart 

Interactions  with  Electrically  Long  Structures"  (Invited) 

LUNCH 


0830-1 630  SHORT  COURSE  (FULL-DAY)  101 A  Spanage!  Hall 

"Using  Mathematical  Software  for  Computational  Electromagnetics"  Jovan  Lebaric,  Naval  Postgraduate  School 


0830-1630  SHORT  COURSE  (FULL-DAY) 

"Wire  Antenna  Modeling  Using  NEC" 


109  Glasgow  Hall 

Dick  Adler,  Naval  Postgraduate  School 

Jim  Brea  kail,  Penn  State  University 

Gerry  Burke,  Lawrence  Livermore  National  Lab 


0830-1630  SHORT  COURSE  (FULL-DAY)  102  Glasgow  Hall 

"FDTD,  Generalized  FDTD  and  FVTD  Techniques  in  Solving  Maxwell's  Kane  Yee,  Lockheed 

Equations" 


SESSION  1: 

SCATTERING 

Chairs:  V.  Cable,  E.  Miller 
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A  CGFFT  METHOD  APPLIED  TO  THE  SCATTERING 
FROM  FINITE  SIZE  MICROSTRIP  ANTENNA 


A.  McCowen 

Department  of  Electrical  and  Electronic  Engineering 
University  of  Wales,  Swansea,  UK 


Abstract 

Conjugate  gradient  EFT  (CGFFT)  methods  which  solve  the  electric  field  integral  equation 
(EFIE)  have  previously  been  applied  to  microstrip  structures.  However,  the  methods  reported 
assume  an  infinite  planar  substrate  (or  layered  substrate)  and  cannot  be  applied  to  finite 
structures,  or  structures  with  recesses  or  general  inhomogeneity  in  the  substrate. 

This  paper  will  discuss  the  CGFFT  method  which  has  been  used  to  solve  the  EFIE  for 
finite  structures.  Initial  results  from  this  technique  have  shown  good  agreement  with 
conventional  method  of  moment  solutions.for  bistatic  res  and  induced  current  distributions. 

INTRODUCTION 

Conjugate  gradient  FFT  (CGFFT)  methods  which  solve  the  electric  field  integral  equation 
(EFIE),  have  previously  been  applied  to  microstrip  structures  and  several  authors,  eg  [1],  have 
demonstrated  their  efficiency  in  CPU  and  memory.  However,  the  methods  reported  assume  an 
infinite  planar  substrate  (or  layered  substrate)  and  ground  plane  which  are  modelled  by  a 
modified  Green’s  function,  and  are  thus  unsuitable  for  finite  structures  or  structures  with  recesses 
and  general  inhomogeneity  in  the  substrate. 

Based  on  the  author’s  previous  works  [2,  3],  a  CGFFT  method  has  been  developed  to 
cope  with  the  mixed  dielectric/conductor  problem  associated  with  microstrip  of  finite  size  and 
having  arbitrary  planar  substrates  (presently  restricted  to  isotropic).  Similar  structures  have  been 
modelled  in  previous  works,  which  includes  a  recent  paper  by  Sarkar,  Rao  and  Djordjevic  [4], 
but  these  works  have  been  based  on  the  Method  of  Moments  (MoM)  for  mixed 
conductor/dielectric  problems  which  is  well-known  to  be  prohibitively  expensive  in  CPU  and 
computer  storage  for  scatterers  in  the  resonance  region. 

This  paper  will  discuss  the  CGFFT  method  which  has  been  used  to  solve  the  EFIE  for 
finite  size  microstrip  structures,  and  show  initial  results  from  the  scheme.  The  results  of  bistatic 
res  and  current  distribution  on  the  scatterers  will  be  given  and  compared  to  the  results  of  the 
method  of  moments  scheme  based  on  the  combined  field  integral  equation  (CFIE)  for  mixed 
dielectric/conductor  problems  which  was  developed  by  Rao,  Wilton  and  Glisson  [5]  and  was  also 
used  in  [4]. 


2 


THE  CGFFT  METHOD 


The  problem  domain  comprises  2  subdomains  S  and  V.  Firstly  S  is  the  set  of  N  two- 
dimensional  pec  surfaces  stacked  parallel  to  the  x-y  plane,  each  separated  by  Az  and  discretised 
by  Ax  and  Ay  in  the  usual  way  for  CGFFT  methods.  Secondly  the  subdomain  V  is  an 
inhomogeneous  dielectric  which  is  uniformily  discretised  into  cells  of  size  AxAyAz  which  are 
interleaved  between  the  N  pec  surfaces  of  the  subdomain  S,  see  Fig.  1.  Each  cell  is  assumed  to 
have  a  constant  value  of  relative  permittivity.  If  Js  and  Jv  are  the  current  continue  distributions 
on  the  pec  surfaces  and  the  polarization  currents  respectively  then  the  EFIE  takes  the  form 


(1) 


where  Es'  and  Ev'  are  the  known  incident  electric  fields  on  S  and  V  respectively.  The  Green’s 
dyadic  terms  in  the  impedance  matrix  represent  the  self  and  mutual  coupling  between  the 
subdomains.  It  should  be  noted  that  Gs.s  is  determined  by  a  straight  forward  extension  of  the 
formulation  for  resonant  pec  plates  given  in  [2]  in  which  the  null  electric  boundary  condition  is 
applied  to  S  and  Gw  is  as  determined  in  [3]  for  a  3D  inhomogeneous  dielectric  scatterer.  The 
cross-coupling  terms  are  similar  to  those  in  G.,,  and  Gw  but  are  interpolated  in  the  way  defined  by 
Shen  [51  who  applied  an  FFT-based  algorithm  to  two  flat  pec  plates  to  fonn  a  dihedral  reflector. 
A  conjugate  gradient  FFT  based  algorithm  is  then  used  to  drive  the  left-hand  side  of  (1)  to  be 
equal  to  the  known  right-hand  side. 


Fig.  1  The  domain  of  the  scheme  showing  the  subdomains  S,  a  set  of  pec  plates,  and  V  layers  of 
dielectric. 
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RESULTS 


A  typical  example  of  the  results  to  be  shown  is  now  given.  Fig.  2  shows  a  rectangular 
patch  antenna  of  width  W  and  length  L  with  a  feed  line  of  width  w  and  length  1,  over  a  finite 
ground  plane.  For  this  case  the  structure  comprises  the  subdomain  S  which  is  the  set  of  2  pec 
plates,  one  of  which  is  the  ground  plane,  and  the  subdomain  V,  which  comprises  a  single  layer  of 
dielectric  of  thickness  h  (=  Az).  It  should  be  notefd  that  the  scheme  is  flexible  enough  to  take  a 
similar  structure  with  a  thick  substrate,  ie  one  in  which  H  -  (N-1).  Az). 


The  far-field  in  the  x-z  plane  is  shown  in  Fig.  3  for  the  structure  as  defined  in  Table  1  and 
illuminated  by  a  plane  wave  at  nonnal  incidence  and  polarised  along  the  x-asis.  The  results  of 
the  present  scheme  are  compared  to  the  results  from  the  method  of  moments  schemes  [4]  and  are 
shown  to  be  in  excellent  agreement  with  Ax  =  Ay  =  0.0625 and  Az  =  0-0125Xo,  1560  unknowns 
are  generated  for  this  case  and  the  storage  requirement  for  the  impedance  matrix  in  the  spectral 
domain  is  0.7Mbytes.  With  the  initial  estimate  of  the  current  density  taken  to  be  zero  the 
interative  scheme  took  400  iterations  to  converge  with  a  relative  error  tolerance  of  0.025.  The 
cpu  taken  to  generate  the  solution  was  approximately  2  sec.  per  iteration  on  a  Dec  Alpha  2100. 
The  generation  of  the  impedance  matrix  is  a  negligible  proponion  of  the  solution  time. 


Table  1  Data  for  the  structure 

Substrate  Patch  and  feed 


er  =  2.56 
h  =0.0125>.o 
Lg=  1.25A,o 

Wg  =  0.0375>,o 


L  =  W  =  0.3125^o 
w  =  0.0625A.O 

1  =0.3125>.o 


Substrate 


Fig.  2  Geometry  of  the  microstrip  structure. 
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e 

Fig.  3  Far  field  (20  log|EJ  versus  0  in  the  Z,-x  plane  for  the  geometry  and  data  of  Fig,  2  and 

Table  1.  Results  are  compared  to  those  from  [4]. 
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Problems  of  EM  scattering  by  atmospheric  hydrometeors  are  very  in^rtant  via 
plural  practical  ^jplications.  Under  tracing  atmosphere  by  \^ether  radars  the  work 
wavelength  happens  to  be  very  sensitive  for  the  presence  of  atmosj^ieric  particles 
by  virtue  such  effects  as  attenuation  of  propagating  field,  scattering,  depolarization 
effects  and  so  on  arise.  Discrete  Sources  Method  (DSM)  seeim  to  be  very 
appropriate  for  analysis  of  such  conq)lex  scattering  jM'oblems  as  multiple  scattering 
by  a  cluster  of  stremg  interacting  ncxi-spherical  particles  [1],P]. 

In  the  fiame  of  DSM  the  scattered  field  is  represented  as  a  superposition  of 
muhipoles^  fields.  It  satisfies  to  Maxwell’  equations  and  Silver-MuUer  radiation 
conditions  exactly.  Multipoles  amplitudes  needs  to  be  determined  iBrom  the 
boundary  cemdition  at  the  obstacle  surface.  If  the  obstacle  represents  an  aggregate 
of  axi-symmetric  particles  it  is  found  to  be  possible  to  examine  con^lete 
mathematic  models  on  the  basis  of  GMT.  To  sinqjlify  the  problem  we  employ 
azimuthal  Fourier-series  expansion  both  for  scattering  and  exited  fields  for 
reduction  3-D  space  scattering  problem  to  the  set  of  the  approximating  problems  at 
the  surface  element  of  an  obstacle. 

We  use  the  special  construction  for  EM  fields  of  the  multipoles.  Multipoles  are 
deposited  in  the  (xmplex  plane  joining  to  the  axis  of  symmetry.  The  harmonics  of 
the  multipoles  can  be  represented  as  a  finite  linear  combination  of  the  elementary 
functions.  Multipole’  amplitudes  for  each  harmonic  should  be  determine  as  a 
pseudosolution  of  the  overdetermined  linear  system  obtained  fitwn  the  point- 
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matching  ^pproadL  It  allows  to  get  a  minimum  of  the  mean-square  norm  for  the 
residual  of  £M  fields  at  the  obstacle  surface.  The  efficiency  of  the  DSM  code 
allows  to  employ  any  kind  of  perscmal  cixnputers.  One  is  able  to  investigate  an 
obstacle  having  the  total  meridian  length  up  to  10  exciting  wavelengths  and  to 
estimate  the  error  of  the  result  obtained. 

This  technique  was  ^>plied  to  the  analysis  of  the  scattering  of  TM/TE  plane  wave 
(wavelength  was  3.2  cm)  by  atmospheric  hydrometeors.  The  hydrcmieteor  sh^ 
was  chosen  following  to  the  real  hydrometeor  forms  [3].  Conq>arison  of  scattering 
diaracteristics  for  the  single  particle  with  results  obtained  from  spherical 
ai^oximation  using  for  the  particle  sh^>e  was  perfixmed.  We  have  established  the 
limits  of  si^erical  particles*  approximation  for  atmosj^ere  hydrometeors  [4]. 

The  investigation  of  wave  scattering  using  the  dynamic  model  was  performed.  It 
allows  to  calculate  the  multiple  scattering  by  the  cluster  of  strong  int^acting  non- 
spherical  particles  [5].  The  particles  differing  one  fix)m  others  both  by  shape  and 
permittivity  can  be  used  as  elements  of  the  cluster.  This  model  seems  to  be  very 
usefiil  imder  investigation  of  the  wave  propagation  through  an  active  atmosj^b^e 
**meltmg  layer*'  in  that  hydrometeors  are  in  the  state  of  fragmentation  and 
coaguMon.  Results  of  ccm^lete  EM  analysis  of  sudi  syst^ns  show  the  essentially 
different  medianism  of  scattering  by  the  cluster  of  strong  interacting  particles  than 
by  non-interacting  ones. 
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Abstract 

A  theoretically  exact  solution  which  determines  the  scattered  fields  due  to  TM  incidence  of  an 
electric  plane  wave  on  an  infinitely  long  (no  z  variation)  perfectly  electrically  conducting  cylinder 
surrounded  with  variable  layers  of  lossy  dielectric  materials  is  presented.  This  solution  is  used  to 
calculate  the  radar  cross-section  (RCS)  of  an  un-coated  cylinder,  a  cylinder  with  a  lossy  dielectric 
coating  where  the  constitutive  parameters  are  assumed  to  be  invariant  with  frequency,  and  cylinders 
where  the  constitutive  parameters  of  the  lossy  coating  vary  with  frequency. 


I  Introduction 

On  naval  vessels,  and  other  self  contained  environment,  the  proximity  of  RF  transmit  ter /receiver  equip¬ 
ment  to  conduction  cables  may  produce  undesirable  reflections.  Primarily  an  electromagnetic  compati¬ 
bility  (EMC)  problem,  this  investigation  presents  theoretical  data  demonstrating  the  potential  for  lossy 
materials,  also  known  as  radar  absorbant  material  (RAM),  to  reduce  unwanted  reflections  as  opposed  to 
the  difficult  task  of  determining  a  physical  configuration  which  is  designed  to  minimize  these  problems. 
The  method  chosen  for  this  effort  is  the  exact  series  solution  due  to  the  geometry  of  the  problem  as  well 
as  the  obvious  numerical  efficiency  advantages.  The  exact  series  solution  require  only  a  fraction  of  the 
time  for  calculation  as  the  MOM  or  TLM.  In  addition,  unlike  the  MOM  or  TLM,  the  exact  series  solution 
is  theoretically  exact.  Only  the  TM  case  is  presented  in  this  paper,  but  the  techniques  presented  here 
can  be  used  to  develop  the  TE  scattered  field  formulation  and  RCS. 

II  Analytic  Formulation 

Shown  in  Figure  1  is  a  TM^  electric  field  incident  on  an  infinitely  long  (no  z-variation)  wire  with  N 
concentric,  dielectric  coatings  (regions  1  to  N)  where  each  region  has  it’s  own  constitutive  parameters 
(SrrMrj  ^ud  k^)-  From  Harrington  [1]  we  know  that  the  total  electric  field  for  the  N-fl  region  (free 
space)  is  defined  as 

EU+i  =  Ei,jv+1  +  =  Eo  E  r"(^( W^>)  +  (1) 

n=  — OO 

and  for  regions  m<N,  the  total  electric  field  is  defined  as 

•foo 

K.m  =  Ei,„  +  =  Eo  E  (2) 

n=  — oo 
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Figure  1:  Geometry  of  Problem.  Circular  PEC  cylinder  coated  with  N  layers  of  dielectric  material. 


where  the  subscript  m  denotes  the  region,  E*  is  the  z- component  of  the  total  electric  held,  E*  and 
are  the  incident  and  scattered  z-directed  helds  respectively,  and  Eq  is  the  magnitude  of  the  incident 
held.  Jn  is  the  Bessel  function  of  the  hrst  kind,  is  the  Neumann  function,  and  represents  the 
Hankel  function  of  the  second  kind.  An,  and  Cn,m  are  all  coeihcients  modifying  the  Bessel  functions. 
Solving  for  the  coefficients  is  necessary  to  determine  the  scattered  held,  and  thus  the  RCS.  In  order  to 
do  this,  we  use  the  boundary  conditions 


KRe,ionm{P  =  Pl)  =  0^^  (3) 

^l,Regicm  m{p  =  Pm+l)  =  ^^Region  m+liP  =  Pm+l)  for  all  pm  2<m<N+l  (4) 

Region  miP  ~  Pm+l)  =  Region  m+l(P  =  /^m+l)  aU  pm  2<m<N  +  l  (5) 

Application  of  Maxwell’s  equations  and  these  boundary  conditions  to  1  and  2  give  us  the  following 
solution  for  an 

_  _(  Jn(fei\r+lPj^+l)  ~  Jn{^N^\PN+\)Dn,N^-\  \ 

\H^\kN+lpN+l)  -  S^^\kN+lpN-hl)Dn,N+l) 


where 


and 


Dn 


0 

— 1  — 1  Pm)~l~  -PniW— 1  iPm) 

Vm  Pm.)+Fn,7n-l 


if  m=l 
otherwise 


f  Jnj^mpTTi)  Jn(^TnPm)-Pn,m  \ 
yNn{kmPm)  ~  E^ni^rnPm)^  n,m  j 


(7) 

(8) 


In  order  to  determine  an,  must  be  solved.  This  can  then  be  used  to  solve  for  Dn,2  which  is  used  to 
solve  for  Enz-  This  process  is  repeated  until  Dn,N-\-i  Is  determined,  at  which  point  an  can  be  calculated. 
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Once  the  coefficient  0^  has  been  determined,  the  scattered  held,  E^,  is  known,  and  so  the  RCS  can  be 
calculated.  The  formula  for  the  RCS  is  defined  as 

RCS  =  Hm  (9) 

P-.00 

E*  is  generally  give  as  unity,  so  using  the  asymptotic  formula  of  the  Hankel  function  of  the  second  kind 
reduces  the  formula  to  an  easily  calculable  form.  The  formulas  presented  here  were  used  to  calculate  the 
RCS  when  the  coating  is  lossless  for  various  cases,  and  compared  against  other  methods  [2]an(l  [3],  and 
shown  to  be  in  excellent  agreement. 

Ill  Numerical  Results 

In  this  section  analytic  data  is  presented  showing  the  effects  of  a  RAM  coating  on  a  right  cylinder.  The 
frequency  dependence  of  the  relative  permittivity  and  permeability  of  each  material  is  shown  in  Figure  2 
and  the  corresponding  RCS  in  Figure  3  (RCS  of  cylinder  without  RAM  also  shown).  Several  methods 
for  presenting  the  RCS  results  were  considered  and  a  contrasting  of  the  behavior  of  each  material  to  an 
un-coated  wire  was  most  appropriate  (Figure  3) 


Figure  2:  Real(-)  and  imaginary  permittivity  and  permeability  of  RAM  material  over  8  to  12  Ghz  range. 

The  use  of  frequency  independent,  but  stiU  lossy,  RAM  (s  and  fi  frequency  independant)  was  included 
for  the  purpose  of  an  additional  comparison  of  the  RAM  material  and  is  not  representitive  of  any  known 
RAM.  However,  because  the  use  of  a  single  value  for  e  and  fi  for  a  given  RAM  commonly  occurs  for  a 
narrow'  band  analysis  of  the  RCS,  both  e  and  fi  were  selected  accordingly.  Mid- band  values  (  freq  = 
10.28  Ghz)  were  used  as  e  and  }jl  for  the  frequency  invariant  RAM,  and  assumed  to  be  constant  over  the 
entire  X-band.  Figure  3  clearly  shows  that  the  RCS  versus  frequency  dependance  is  not  strongly  effected 
by  the  use  of  the  average  value  of  the  RAM  constituitive  parameters.  This  indicates,  at  least  for  the 
cases  considered,  that  an  approximate  or  average  value  of  certain  parameters  yield  generally  acceptable 
results.  Figure  3,  considering  the  permitivity  and  permeability  to  be  constant  over  this  frequency  range 
gives  comparable  results  to  the  plot  of  the  RCS  when  the  parameters  vary  with  frequency.  Both  radar 
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Figure  3:  RCS/A  of  (a)  5/16  and  (b)  3/16  inch  diameter  wire  with  no  coating  with  a  2.543  mm 
coating  where  the  constitutive  parameters  are  constant  £^=9-51 14-10.8679  /Zr =0.7326-10.2186  (-  -),  and 
with  a  2.543  mm  thick  coating  with  the  frequency  dependent  parameters  shown  in  Figure  2. 

absorbing  materials  give  the  maximum  attenuation  at  approximately  the  same  frequency,  but  for  the 
5/16  inch  wire,  the  frequency  invariant  RAM  results  have  a  minimum  RCS  approximately  3  dB  lower 
than  the  frequency  dependent  RAM  data.  Also,  for  the  8.5  to  9.5  Ghz  range,  the  frequency  dependent 
RAM  shows  a  smaller  RCS.  For  the  3/16  inch  wire,  the  frequency  dependant  RAM  give  a  smaller  RCS 
(«7  dB)  for  the  low  end  of  the  X-band.  This  indicates  that  for  a  high  degree  of  accuracy,  the  frequency 
dependence  of  the  constituitive  parameters  must  be  taken  into  account. 

It  is  also  shown  in  Figure  3  that  the  frequency  dependent  RAM  gives  maximum  attenuation  at 
the  frequency  corresponding  to  a  quarter  wavelength  in  the  material  for  the  TM  case.  Figure  4  shows 
three  RCS  curves  for  varying  thicknesses  of  the  frequency  independent  RAM  coating-  The  peak  value 
of  attenuation  occurs  at  frequencies  corresponding  to  a  RAM  thickness  of  A/4.  A  comparison  of  the 
frequency  varying  RAM  curves  is  shown  in  Figure  4. 

IV  Measurements 

The  measurements  for  the  experimental  data  were  conducted  using  a  Naval  Research  Laboratory  (NRL) 
arch  arrangement  configured  with  dual- ridged  broadBand  transmit  and  recieve  horns  and  a  Hewlett 
Packard  8510  B  Network  Analyzer  with  the  time  domain  windowing  option.  The  windowing  option  was 
used  to  isolate  the  power  returns  from  the  bare  and  RAM  coated  wires.  In  all  cases  the  return  from  the 
bare  and  RAM  coated  wire  was  significantly  higher  than  any  system  noise,  and  therefore  a  high  degree 
of  accuracy  can  be  assumed  for  the  measurements.  Samples  of  the  RAM  material  were  analyzed,  and  the 
scattering  parameters  were  calculated  using  the  HP  analyzer.  The  constituitive  parameters  were  then 
extracted  from  the  scattering  matrix.  Several  samples  were  taken  at  different  locations  on  each  wire  as 
a  test  for  anisotrophies. 
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Figure  4:  RCS/A  of  a  3/16  inch  diameter  wire  with  a  2.5  mm  RAM  coating  (oo),  2.75  mm  RAM 
coating  (-),  and  3.0  mm  RAM  coating  (-  -).  The  constitutive  parameters  are  constant  £,.=9.51 1+iO. 8679 
/ir=0.7326-i0.2186. 

V  Conclusions 

Careful  consideration  must  be  taken  with  the  values  of  the  constituitive  parameters  when  modeling 
radar  absorbing  materials  on  conductors  at  high  frequencies.  Since  variations  in  these  parameters  may 
significantly  alter  the  calculated  attenuation  due  to  the  concebtric  RAM  layer,  attention  must  be  given  to 
obtain  data  reflecting  the  frequency  dependence  of  complex  permeability  and  permitivity  if  a  high  degree 
of  accuracy  is  required.  All  simulations  in  the  research  showed  good  agreement  with  the  measurements. 
A  peak  attenuation  of  20-25  decibels  was  shown  in  the  simulated  case.  This  large  value  of  attenuation 
was  found  to  occur  at  frequencies  corresponding  to  a  RAM  thickness  of  A/4.  It  was  also  shown  that  an 
average  value  of  £  and  /x  of  the  actual  RAM  (frequency  dependent)  could  be  used  for  approximate  RCS 
values  over  a  narrow  frequency  range  for  the  TM  mode  considered. 
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ABSTRACT 

In  this  paper,  the  three-dimensional  time  domain  Transmission  Line  Matrix  (TLM)  method  has 
been  applied  to  compute  the  RCS  of  dielectric  cubes  of  relative  permittivity  37.84  and  79.46,  and 
comparison  has  been  made  with  published  measurements  [1].  Since  the  high  permittivity  cubes 
ring  for  a  very  long  time,  very  good  quality  absorbing  boundary  conditions  having  long  term  sta¬ 
bility  are  required.  We  have  achieved  these  by  modifying  Higdon’s  absorbing  boundary  condi¬ 
tions.  Long  term  stability  has  been  obtained  by  using  proper  discretization  of  the  boundary 
operators  (derivatives),  and  very  low  reflections  have  been  obtained  by  concatenating  two  first- 
order  boundary  operators.  We  have  obtained  stability  beyond  60,000  time  steps  by  placing  the 
absorbing  boundaries  just  20  cells  away  from  the  cube  surfaces  (cube  size  is  15  cells).  The  RCS 
data  computed  with  the  TLM  method  agree  very,  well  with  the  measurements  published  in  [1]. 
The  frequency  shift  between  the  TLM  results  and  the  measurements  is  negligible,  while  a  consid¬ 
erable  frequency  shift  has  been  reported  between  results  obtained  with  FDTD  and  measurement 
[!]• 

1.  INTRODUCTION 

The  Transmission  Line  Matrix  (TLM)  method  is  a  numerical  technique  in  which  both  space  and 
time  are  discretized  [2].  The  simulation  of  propagation  of  electromagnetic  waves  is  done  through 
scattering  of  impulses  in  a  3-D  meshed  network  of  transmission  lines.  This  method  is  suitable  for 
computation  of  radar  cross-section  (RCS)  of  complex  bodies.  The  RCS  over  a  wide  frequency 
bandwidth  can  be  obtained  from  a  single  TLM  simulation.  Earlier  studies  on  computation  of  RCS 
using  the  TLM  method  have  concentrated  mainly  on  perfectly  conducting  targets  [3-6].  This 
paper  studies  the  scattering  properties  of  high  permittivity  dielectric  cubes.  As  these  cubes  ring  for 
a  very  long  time,  highly  stable  absorbing  boundary  conditions  are  required  for  a  TLM  analysis  as 
compared  to  the  perfectly  conducting  targets.  Eventhough  a  number  of  absorbing  boundary  condi¬ 
tions  have  been  reported  in  the  literature,  Higdon’s  absorbing  boundary  conditions  [7]  have  been 
found  to  perform  better  than  others  [8].  The  absorption  properties  in  the  required  frequency  range 
can  be  optimized  by  taking  advantage  of  the  prior  information  about  the  incident  angles  and  by 
combining  several  first-order  boundary  operators  [9-10].  Also,  the  long  term  stability  can  be 
obtained  by  choosing  the  proper  finite  differences  for  the  boundary  operators. 

We  have  used  the  symmetrical  condensed  node  (SCN)  TLM  [11]  (shown  in  Fig.l)  for  obtaining 
the  scattered  tangential  electric  and  magnetic  fields  on  the  fictitious  current  surfaces.  The  advan¬ 
tages  of  this  node  when  compared  to  the  expanded  TLM  node  and  Yee’s  FDTD  node  are  the  fol¬ 
lowing:  boundary  description  is  easier,  and  all  six  field  components  can  be  defined  at  single  points 
in  space.  It  has  six  branches,  each  branch  consisting  of  two  uncoupled  two-wire  transmission 
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lines.  The  12  transmission  lines  linking  the  Cartesian  mesh  of  nodes  together  have  the  characteris¬ 
tic  impedance  of  free  space.  Each  line  is  associated  with  two  fields.  For  example,  a  voltage 
impulse  incident  upon  the  port  3  is  associated  with  the  field  quantities  Ey  and  H^. 


2.  ABSORBING  BOUNDARY  CONDITIONS 

To  obtain  good  absorption  over  a  wide  range  of  incident  angles,  we  have  concatenated  two  first- 
order  Higdon’s  boundary  operators  to  obtain  a  second-order  absorbing  boundary  condition  [9].  A 
voltage  impulse  reflected  from  the  absorbing  boundary  can  be  computed  from  the  knowledge  of 
impulses  in  the  cells  in  front  of  the  boundary  using  the  following  equation: 

V'(m,i,k)  =  (m,i,k)-a^a^V'~\m,j,k)  +  (Pj  +  P^)  v"  (m- 1  J.i) 

+  “  (ajY2  +  Yia2)V"”^(m-lJ,/:) 

-P  j P2  V"  (m  -  2, 7,  i)  -  ( P ^72  +  Yi P2)  V"  “  ‘  (m  -  2, 7,  <:)  -  Yi Y2 ^  C'"  -  2.  A  ^ 

The  interpolation  coefficients  are: 

{a-g.{\~b))  ^  ^  (-a~bg.) 

“/  =  (a-i-j.(i-6)-e,A/)’l^r-  "  (a-l-g,.(l-fe)-e.A/)’^<-  "  (a  -  1  -  ( 1  -  fc)  -  e.A/) 


where  coefficients  a  and  b  are  weighted  time  and  space  averages  of  the  space  and  time  differences, 
respectively.  Sj  and  82  are  damping  factors.  The  parameter  g^  is 


cose. 
c  At 


(3) 


where  A1  and  At  are  the  space  resolution  and  time  step  respectively.  0|  are  the  incidence  angles. 
For  the  symmetrical  condensed  node,  g^  becomes  g.  =  2  cosS .. 
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The  values  of  a  and  b  can  be  chosen  to  control  the  stability  of  the  absorbing  boundaries.  Accord¬ 
ing  to  Higdon  [7],  a  must  be  less  than  or  equal  to  0.5  (for  a=^b)  to  get  stable  absorbing  boundary 
conditions. 

3.  NEAR-FIELD  TO  FAR-FIELD  TRANSFORMATION 

The  near-field  to  far-field  transformation  technique  based  on  the  equivalence  principle  is  well 
known.  According  to  this  principle,  the  far  electromagnetic  field  components  can  be  computed  if 
the  tangential  electric  and  magnetic  fields  on  a  closed  fictitious  surface  surrounding  the  scatterer 
are  known.  If  the  fields  outside  of  the  closed  surface  are  Eg  and  then  the  surface  electric  and 
magnetic  current  densities  are 


=  M^  =  -hxE^,  (4) 

where  h  is  the  local  surface  unit  normal.  These  current  densities  can  be  calculated  using  the  TLM 
method.  These  should  be  calculated  at  the  required  number  of  frequencies  and  stored  for  further 
processing.  After  the  current  densities  have  been  computed,  the  electric  and  magnetic  vector 
potentials  can  be  calculated  from  the  following  equations: 


\le  r  sin9cos(J)  +  yjSin9sin(})  +  z^cos9) 

A  =  — 1 - \J^e  as  (5) 

4TZr  J  s 
s 

r  //:(JC,sin9cos(!>  +  >,sin9sin({)  +  z  cos9) 

F  =  — -  M  e  "  ^  ^  ds  (6) 

4Ttr  J  ^ 
s 

where  Xg,  yg,  and  Zg  are  the  co-ordinates  of  the  source  point  on  the  current  surface.  0  and  ^  are  the 
spherical  co-ordinates  of  the  far  point.  The  far-field  electric  components  can  be  computed  from 
the  knowledge  of  electric  and  magnetic  vector  potentials: 

£0  =  +  E^  =  -j(o{A^~r\FQ),  (7) 

where  T|  is  the  free  space  impedance. 

4.  NUMERICAL  RESULTS 

The  dielectric  cube  of  size  8.97  mm  and  permittivity  37.84  was  discretized  into  15  SCN-TLM 
cells  on  each  side.  The  space  resolution  and  time  step  were  0.598  mm  and  0.9966  ps,  respectively. 
A  plane  wave  with  a  electric  field  component  Ey  and  magnetic  field  component  was  incident 
on  one  face  of  the  cube  normally  (shown  in  Fig.  2).  This  was  achieved  in  the  symmetrical  con¬ 
densed  node  by  launching  the  impulses  on  branch  3.  We  have  used  a  cosine  modulated  Gaussian 
pulse  (shown  in  Fig.  3)  as  the  excitation  to  make  sure  that  only  the  frequencies  of  interest  were 
excited.  This  also  ensured  that  the  D.C.  and  very  low  frequencies  were  not  excited,  otherwise,  the 
absorbing  boundaries  could  not  be  placed  close  to  the  cube  surface.  The  excitation  Gaussian  pulse 
width  (corresponding  to  -34  dB)  was  127  At.  The  values  of  the  incidence  angles  used  in  the  design 
of  the  absorbing  boundaries  were  45°  and  85°.  The  absorbing  boundaries  were  placed  20  cells 
away  from  the  cube  surfaces.  The  time  response  of  the  Ey  field  for  two  sets  of  coefficients  a 
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Fig.  2:  The  high  permittivity  cube  with  incident  electric  field  Ey 

(weighted  time  average  of  the  space  difference)  and  b  (weighted  space  average  of  the  time  differ¬ 
ence)  are  plotted  in  Figs.  4a  and  4b.  Note  that  the  oscillations  start  around  2500  time  steps  for  the 
case  ^=^7=0.25,  while  there  is  no  stability  problem  for  a~b-~0. 25.  Hence,  we  have  used  a=^=-0.25 
in  all  our  computations.  Stability  beyond  60,000  time  steps  has  been  obtained  in  both  dielectric 
cubes  of  permittivity  37.84  and  79.46  with  just  20  cells  separating  the  cube  edges  and  the  absorb¬ 
ing  boundaries,  while  growing  oscillations  beyond  32,768  time  steps  have  been  reported  in  the 
FDTD  analysis  of  the  cube  of  permittivity  79.46  using  Mur’s  absorbing  boundaries  (in  spite  of 
twice  the  spacing  of  40  cells  between  the  cube  edges  and  the  absorbing  boundaries). 

The  computed  RCS  for  15,000  TLM  time  steps  is  plotted  in  Fig.  5.  When  compared  with  the  mea¬ 
sured  results  (Fig.  6),  it  can  be  seen  that  the  convergence  is  not  yet  reached.  Fig.  6  shows  the  RCS 
data  computed  with  60,000  time  steps.  They  compare  well  with  the  measurements. 

Next,  we  have  computed  the  RCS  of  a  cube  of  size  7.72  mm  and  £j.=79.46  [1].  Again,  the  same 
TLM  discretization  as  above  (i.e.,  15  cells  along  the  length  of  the  cube  and  20  cells  between  the 
cube  edge  and  the  absorbing  boundaries)  was  used.  The  space  resolution  and  time  step  were 
0.5147  mm  and  0.8578  ps,  respectively.  The  width  of  the  cosine  modulated  Gaussian  pulse  was 
148  At,  Fig.  7  compares  the  computed  RCS  (using  60,000  TLM  time  steps)  and  measurement. 
They  compare  well  except  for  a  slight  shift  in  the  TLM  results  towards  lower  frequencies  because 
of  the  coarseness  error. 

5.  CONCLUSIONS 

The  RCS  of  high  permittivity  dielectric  cubes  has  been  obtained  using  the  TLM  method.  The 
results  agree  very  well  with  the  measurements  published  in  [1].  While  the  frequency  shift  between 
the  TLM  results  and  the  measurements  is  negligible  for  ej.=37.84,  it  is  very  small  in  the  case  of 
8j.=79.46.  a  much  larger  shift  has  been  reported  in  the  FDTD  analysis. 

Our  study  shows  that  Higdon’s  absorbing  boundaries  have  superior  absorption  and  long  term  sta¬ 
bility.  Also,  they  are  very  efficient  since  they  need  to  be  placed  only  20  cells  away  from  the  cube 
surfaces. 

The  frequency  domain  near-field  to  far-field  transformation  technique  requires  huge  disk  space  to 
store  the  tangential  electric  and  magnetic  fields  on  the  fictitious  current  surfaces  (of  the  order  of 
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100  Mbytes  for  100  frequency  points).  One  way  of  avoiding  this  is  to  compute  the  far-fields 
directly  in  the  time  domain  as  done  in  [12],  In  that  case,  Prony’s  method  can  be  applied  to  esti¬ 
mate  the  future  time  response  from  a  short  initial  time  response,  leading  to  a  very  fast  TLM  analy¬ 
sis.  Our  future  work  will  concentrate  on  these  techniques. 
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Fig.  3:  Cosine  modulated  Gaussian  temporal  profile  for  the  plane  wave  excitation 


Cube  side  length/A^ 


Fig.  5:  RCS  computed  with  15,000  TLM  time  steps  (cube  side  length  =  8.97 
tj.  =  37.84,  bistatic  angle  =  8  degrees) 


Fig.  6:  RCS  computed  with  60,000  TLM  time  steps  (cube  side  length  =  8.97  mm, 
=  37.84,  bistatic  angle  =  8  degrees) 


Cube  side  length/X,^ 

Fig.  7:  RCS  computed  with  60,000  TLM  time  steps  (cube  side  length  =  7.72  mm, 
8r  =  79.46,  bistatic  angle  =  8  degrees) 
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ABSTRACT 

An  efficient  computer  algorithm  has  been  developed  to  predict  accurate  three-dimensional 
backscattering  RCS  levels  from  antenna/array  installations  or  panels/doors  on  an  electrically  large 
curved  surface  such  as  aircrafts  and  missiles.  The  new  algorithm  significantly  reduces  the 
computation  time  of  a  scattering  prediction  code(UFIM)  while  maintains  the  same  accuracy  of  the 
prediction.  This  paper  describes  the  new  process  which  was  incorporated  into  the  UFIM  code  and 
demonstrates  its  enhanced  capability  as  an  efficient  design  tool  for  the  practical  antenna/anray 
installations  or  panels/doors  on  a  large  curved  scattering  object. 


INTRODUCTION 

The  electromagnetic(EM)  contribution  from  antenna/array  installations  or  panels/doors  is  a 
matter  of  increasing  concern  in  radar  scattering,  especially  for  low  observable(LO)  vehicle  design. 
Accurate  RCS  prediction  of  those  complex  discontinuities  on  a  large  scattering  object  is  difficult  due 
to  lack  of  numerically  efficient  three-dimension al(3-D)  analytic  solutions  or  too  many  unknowns  for 
matrix  inversion.  Measurement  approaches  are  also  limited  due  to  quiet- zone  size  of  indoor  RCS 
measurement  ranges  and  the  high  cost  of  the  full-scale  test  model.  Last  year  the  Uniform  Field 
Integration  Method(UFIM)[l]  was  introduced  as  an  efficient  computational  technique  to  accurately 
predict  3-D  RCS  levels  from  a  contour  along  a  piecewise  continuous  curve  on  a  planar  surface.  The 
method  utilizes  simple  two-dimensional(2-D)  scattered  field  formulations.  The  contour  was  defined 
by  any  localized  discontinuities  of  arbitrary  cross  section  such  as  small  step,  slit,  groove,  dielectric 
filled  gap,  bump,  impedance  discontinuity,  and  coated  or  conducting  wedges.  The  UFIM  is  a  hybrid 
approach,  bridging  the  gap  between  high  frequency  techniques  for  electrically  large  objects  and  low 
frequency  techniques  for  electrically  small  objects.  This  paper  describes  an  actual  application  of  the 
UFIM  to  practical  design  problems  of  antenna/array  installations  and  panels/doors  on  a  large  curved 
scattering  object  and  its  significant  improvement  in  computational  efficiency. 
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NEW  UFIM  PROCESS 


A  typical  UFIM  analysis  process  on  a  curved  surface  is  shown  in  Figure  1.  The  contour  on  a 
smooth  curved  surface  can  be  approximated  with  a  piecewise  continuous  curve  if  the  size  of  each 
piecewise  section  is  very  small  (e.g.,  less  than  0,1X).  Then  the  total  3-D  RCS  levels  from  the  contour 
can  be  obtained  by  summing  3-D  scattering  contributions  from  aU  piecewise  sections  along  the 
contour.  The  3-D  scattering  contribution  from  each  piecewise  section  can  be  obtained  by  an  UFIM 
process.  The  UFIM  process  is  very  efficient  for  predicting  3-D  RCS  levels  from  any  discontinuities 
on  a  surface,  because  only  2-D  scattered  fields  from  the  discontinuity  are  required  instead  of  3-D.  It 
is  noted  that  some  assumptions  made  in  the  UFIM  analysis  process  are  listed  in  the  UFIM  Features 
in  Figure  1.  The  simple  2-D  scattered  fields  may  be  obtained  by  full  wave  computations  such  as 
method  of  moments  (MOM),  Finite  Element  Method(FEM),  Finite  Difference  Time  Domain(FDTD), 
or  Transmission  Line  Matrix (TLM),  or  by  simple  planar  measurements [2].  The  UFIM  is  an  efficient 
2-D  to  3-D  conversion  process  and  calculates  the  3-D  backscattering  RCS  directly  from  the  2-D 
scattered  field  of  the  contour.  However,  in  the  course  of  computing  the  scattering  from  antenna/array 
installations  on  a  electrically  large  curved  surface,  numerous  solutions  to  the  2-D  problem  are  needed 
due  to  many  piecewise  sections  simulating  the  surface  curvature.  That  results  in  the  major 
computation  time  because  most  2-D  solutions  are  numerically  rigorous  but  very  computationally 
extensive.  Therefore,  a  new  algorithm  has  been  developed  and  implemented  into  the  UFIM  code  to 
reduce  the  burden  of  massive  2-D  scattered  field  computation.  The  reduced  computation  time  is  a 
result  of  the  fact  that  the  cross  sectional  dimension  of  a  discontinuity  is  small  compared  to  a 
wavelength  (e.g.,  typically  less  than  O.IX).  Figure  2  shows  the  magnitude  and  phase  plots  of  the  2-D 
scattered  field  from  a  groove  (width  =  0.15cm,  depth  =  0.1 1cm)  computed  by  SWAG  code[3].  The 
plots  show  the  characteristics  of  slowly  varying  magnitude  and  phase  of  the  2-D  scattered  field  for 
the  whole  elevation  angles.  Since  the  2-D  scattered  field  from  an  electrically  small  discontinuity  is 
not  sensitive  to  the  elevation  angles,  one  needs  to  compute  a  single  elevation  pattern  of  2-D  scattered 
field  only  once  for  the  whole  RCS  pattern  computation.  One  may  store  the  2-D  scattering  data  for 
discrete  elevation  angles  and  retrieve  the  stored  sample  data  using  a  simple  searching  program 
whenever  a  2-D  scattered  field  data  is  needed.  The  new  process  significantly  reduces  the 
computation  time  for  discontinuities  on  a  curved  surface.  A  typical  3-D  RCS  pattern  of  an  antenna 
installation  on  a  missile  body  takes  only  several  seconds  which  previously  took  an  hour  or  so. 

To  validate  the  new  UFIM  process,  a  test  model  with  a  groove  on  a  curved  surface(total  length  = 
95cm,  diameter  =  8cm)  were  constructed  as  shown  in  Figure  3.  The  test  model  consists  of  an  ogive, 
a  cylinder,  and  an  ogive  with  a  groove  (width  =  0. 15cm,  depth  -0.1 1cm)  between  the  front  ogive 
and  the  cylinder.  The  results  of  new  UFIM  prediction  were  compared  with  actual  3-D  RCS 
measurements  for  both  horizontal  and  vertical  polarizations  in  Figure  3.  All  comparisons  showed 
very  good  agreement  besides  some  ripples  in  the  measured  data  due  to  the  traveling  wave 
contributions.  The  body  scattering  from  the  ogive-cylinder-ogive  test  model  is  also  computed  using 
Physical  Theory  of  Diffraction(PTD)  technique  to  show  the  significance  of  the  scattering  from  the 
groove  even  though  its  cross  sectional  dimension  is  less  than  O.IX.  A  practical  antenna  installation 
problem  has  been  analyzed  using  the  new  UFIM  process.  A  groove  (width  =  0.15cm,  depth  = 

0. 1 1cm)  along  a  typical  antenna  boundary  (5cra  x  5cm  square  patch)  is  modeled  on  a  cylinder 
(diameter  =  15.25cm).  Figure  4  shows  a  square  shaped  antenna  mounted  on  a  curved  surface  and  its 
edge  is  perpendicular  to  the  nose-on  direction.  Figure  5  shows  the  same  antenna  mounted  on  a 
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curved  surface  but  its  edge  is  45  rotated  to  the  nose- on  direction.  Two  80  conical  RCS  patterns 
were  calculated  for  the  vertical  polarization  using  the  new  UFIM  process.  The  comparison  of  two 
RCS  patterns  in  Figures  4  and  5  clearly  shows  the  scattering  dependency  on  the  antenna  edge  shape. 
A  few  other  shapes  such  as  panels  and  doors  on  a  curved  surface  were  also  analyzed  using  the  new 
process.  The  efficiency  of  the  UFIM  code  has  been  significantly  improved  by  the  new  process  while 
the  new  process  maintains  the  same  accuracy  of  the  scattering  data.  Several  volumetric  RCS  patterns 
were  also  generated  for  antenna  installations  and  panels  on  a  curved  surface  using  the  new  process 
and  took  only  a  few  minutes  for  a  typical  volumetric  pattern.  The  new  UFIM  code  has  demonstrated 
its  full  capability  as  an  efficient  design  tool  for  the  practical  antenna/array  installations  or 
panels/doors  on  a  large  curved  scattering  object. 


CONCLUSIONS 

A  new  computer  algorithm  has  been  developed  to  enhance  the  capability  of  the  3-D  scattering 
code(UFIM)  for  antenna/array  installations  or  panels/doors  on  an  electrically  large  curved  surface. 
The  algorithm  significantly  reduces  the  computation  time  of  the  UFIM  while  maintains  the  same 
accuracy  of  the  prediction.  Several  test  models  were  made  and  the  UFIM  predictions  were  compared 
with  3-D  measurements.  All  test  results  showed  very  good  agreement.  This  paper  demonstrated  its 
enhanced  capability  as  an  efficient  design  tool  for  the  practical  antenna/array  installations  or 
panels/doors  on  a  large  curved  scattering  object.  Another  feature  is  that  one  can  readily  isolate  the 
RCS  contribution  from  electrically  small  discontinuities  on  a  practical  vehicle  using  the  UFIM 
process.  The  main  advantages  of  the  new  process  are  simplicity,  computation  efficiency,  and  its  vast 
application  to  many  complex  EM  scattering  problems  which  may  be  occurred  in  antenna/arxay 
installations  or  panels/doors  on  a  large  scattering  object 


REFERENCES 

[1]  Jacob  J.  Kim  and  Oren  B.  Kesler,  “Efficient  Computational  Technique  for  Backscattering  from  a 
Discontinuity  along  a  Piecewise  Continuous  Curve  on  a  Planar  Surface,”  in  Proc.  10th  Annual 
Review  of  Progress  in  Applied  Computational  Electromagnetics,  Monterey,  CA,  pp.  465-472,  Mar. 
1994. 

[2]  Jacob  J.  Kim,  Thomas  S.  Watson,  and  Oren  B.  Kesler,  “Efficient  RCS  Measurement  Technique 
of  a  Complex  Discontinuity  on  a  Large  Scattering  Object,”  in  Proc.  Sixteenth  Annual  Antenna 
Measurement  Techniques  Association  Symposium,  Long  Beach,  California,  pp.  70-75,  Oct.  1994. 

[3]  S.C.  Gibson,  B.C.  Graham-Jones,  B.L.  Homing,  M.M.  Jones,  O.G.  Romero,  and  C.W.  Yang, 
“Scattering  from  Generic  Surface  Discontinuities,  Part  VI:  User  Manuals,”  Northrop  Corporation 
Aircraft  Division,  December  1989, 


23 


3D  CANONICAL  PROBLEM 


STRAIGHT 

GROVE 


2D  CANONICAL  PROBLEM 


UFIM  FEATURES 

•  Localized  perturbation  currents  from  discontinuity 

•  Utilizes  2D  full  wave  analysis  of  discontinuity 

•  Efficient  contour  integration 

•  Surface  is  piecewise  locally  smooth 


•  Contour  radius  of  curvature  is  electrically  large  normal  incidence 


•  e.g.  SWAG  code 


Fig.  1.  UFIM  analysis  process  on  a  curved  surface. 
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Fig.  5.  A  80  conical  RCS  pattern  from  antenna  installations  of  a  45  ”  rotated  patch  at  18  GHz. 
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Abstract 


An  infrared  (IR)  measurement  technique  is  presented  which  has  been  developed 
to  measure  electromagnetic  (EM)  fields.  This  technique  uses  the  thermal 
absorption  properties  of  a  thin,  planar,  and  lossy  IR  detection  screen  to  map  the 
Intensity  of  the  EM  energy  over  any  two-dimensional  region  of  interest.  This 
measurement  produces  a  two-dimensional  IR  thermogram  of  the  electric  or 
magnetic  field  being  measured,  i.e.  an  isothermal  contour  map  of  the  intensity 
of  the  EM  field.  As  an  example,  I R  thermograms  are  presented  for  the  diffraction 
pattern  of  the  EM  field  scattered  from  an  F16  scale  model  aircraft  for  various 
microwave  frequencies,  angles  of  incidence,  and  polarizations  of  the  incident 
wave  and  for  several  different  orientations  of  the  aircraft  relative  to  the  incoming 
radiation.  The  experimentally  determined  IR  thermogram  for  the  scattered 
electric  field  for  nose-on,  horizontally-polarized  (plane  of  the  aircraft  wings) 
illumination  at  3  GHz  is  compared  to  the  scattered  electric  field  computed  for  the 
same  case  by  the  GEMACS  numerical  electromagnetics  code.  The  agreement 
between  the  techniques  is  discussed  and  used  to  validate  the  results  of  the 
code. 


1.  introduction 

A  non-destructive,  minimally  perturbing  infrared  (IR)  measurement  technique  has  been 
developed  to  observe  electromagnetic  (EM)  fields.  Metallic  surface  currents  and  charges  also 
can  be  measured  with  this  technique. 

This  IR  measurement  technique  produces  a  two-dimensional  IR  thermogram  of  the 
electric  or  magnetic  field  being  measured,  i.e.  an  isothermal  contour  map  of  the  intensity  of  the 
EM  field. 
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This  IR  measurement  technique  recently  has  been  applied  to  determine  microwave 
induced  surface  currents[1],  radiated  fields  from  antennas  and  high  power  microwave  (HPM) 
sources,  e.g.  near-field  intensities  and  far-field  antenna  patterns  of  horns[2-3],  diffraction 
patterns  of  EM  fields  scattered  from  complicated  metallic  objects[4-7],  intensities  of  EM  fields 
coupled  through  apertures  in  shielded  enclosures[8-11],  and  modal  distributions  of  EM  fields 
excited  inside  cavities[1 2-15]. 

Examples  of  EM  energy  scattered  from  an  FI  6  scale  model  aircraft  are  presented  in  this 
paper.  The  results  are  compared  to  data  predicted  numerically  by  the  GEMACS  numerical 
electromagnetics  code.  The  advantages  and  disadvantages  of  this  new  IR  measurement 
technique  are  also  discussed. 


2.  IR  Measurement  Technique 

The  IR  measurement  technique  is  based  on  the  Joule  heating  that  occurs  in  a  lossy 
material  as  an  EM  wave  passes  through  the  material.  A  thin  planar  sheet  of  a  lossy  carbon 
loaded  material  is  used  to  map  electric  fields;  a  thin  planar  sheet  of  a  lossy  ferrite  loaded 
material  is  used  to  map  magnetic  fields.  The  absorbed  heat  energy  is  converted  into 
conducted  and  convected  heat  energy  and  into  re-radiated  EM  energy.  The  radiated  EM 
energy  is  concentrated  in  the  IR  band,  and  is  detected  with  an  IR  Scanning  Array  or  an  IR 
Focal  Plane  Array  (FPA)  as  the  energy  radiated  from  a  "blackbody". 

2.1.  IR  Experimental  Setup 

This  technique  involves  placing  the  lossy  IR  detection  screen  in  the  EM  wave  in  the 
plane  over  which  the  fields  are  to  be  measured.  The  loss  mechanisms  in  the  detector  material, 
i.e.  the  conductivity  and  the  imaginary  components  of  the  permittivity  and  the  permeability  of 
the  detector  material,  absorb  the  incident  EM  energy  as  thermal  heat  energy  and  cause  the 
temperature  of  the  detector  to  rise  above  the  ambient  temperature  of  the  surrounding 
environment  by  an  amount  that  is  proportional  to  the  local  electric  and/or  magnetic  field 
intensity  (energy)  at  each  point  in  the  screen  material.  This  produces  a  two-dimensional  IR 
thermogram  of  the  electric  and/or  magnetic  field  in  the  screen  material. 

2.2.  IR  Detector  Screen 

The  screen  material  is  tailored  to  be  sensitive  to  only  one  component  of  the  field,  e.g. 
by  optimizing  the  values  of  the  electrical  conductivity  and  the  imaginary  part  of  the  permittivity 
of  the  material  relative  to  the  imaginary  part  of  the  permeability  of  the  material,  the  detector 
screen  can  be  made  sensitive  to  only  the  tangential  component  of  the  electric  or  magnetic  field 
in  the  plane  of  the  screen.  Care  is  also  exercised  In  the  selection  of  the  screen  material  not 
to  significantly  perturb  the  electric  or  magnetic  field  by  the  presence  of  the  lossy  material.  The 
screen  is  designed  to  absorb  less  than  5%  of  the  incident  power.  The  constitutive  parameters 
of  the  IR  detector  screen  are  optimized  to  produce  a  large  temperature  rise  in  the  detector 
material  for  a  small  amount  of  absorbed  energy. 
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The  temperature  difference  between  the  screen  material  and  the  background  is 
detected,  digitized,  and  stored  in  the  memory  of  the  IR  camera  on  a  pixel  by  pixel  basis.  The 
system  has  256  by  256  pixels  per  frame  of  data.  This  stored  data  represents  the  temperature 
distribution  over  the  extent  of  the  detector  screen  and  is  a  map  of  the  intensity  of  the  electric 
or  magnetic  field  distribution  absorbed  in  the  screen.  The  Incident  electric  or  magnetic  field 
can  then  be  determined  from  a  solution  of  Maxwell’s  Equations  (Fresnel’s  and  Snell’s  laws) 
for  an  EM  wave  incident  on  a  planar  film  of  lossy  material.  For  small  temperature  rises,  the 
electric  or  magnetic  field  intensity  is  nearly  linearly  proportional  to  the  temperature  change. 


2.3.  IR  Advantages  and  Disadvantages 


The  IR  measurement  technique  provides  a  quick  and  accurate  method  to  observe  EM 
fields  in  a  two-dimensional  plane.  However,  only  the  magnitude  of  the  electric  or  magnetic 
field  is  measured;  no  phase  information  is  detected.  Also,  since  this  technique  is  based  on  the 
thermal  mass  of  a  detector  material,  high  energy  is  required  to  produce  good  thermal  images 
of  EM  fields. 


3.  Scattering  and  Diffraction  Patterns 


As  an  example  of  the  IR  measurement  technique,  a  1 :32  scale  model  of  an  F1 6  aircraft 
was  constructed.  A  plastic  model  of  an  F16  was  sprayed  with  several  coats  of  silver  paint  to 
make  its  surface  conductive.  The  model  was  irradiated  In  the  far  field  of  a  microwave  horn 
antenna,  as  shown  in  Figure  1.  IR  thermograms  were  made  of  the  diffraction  pattern  of  the 


In  the  experiment,  a  large  IR  detector  screen  of  carbon  paper  was  positioned  around 
the  aircraft  in  several  different  orientations.  For  one  series  of  IR  thermograms,  the  screen  was 
positioned  in  the  axial  plane  through  the  middle  of  the  aircraft.  First,  the  screen  was  oriented 
in  the  plane  passing  through  the  wings  of  the  aircraft  to  map  the  scattered  fields  tangent  to  the 
plane  formed  by  the  wings  and  the  fuselage  of  the  aircraft.  Then,  the  screen  was  oriented  in 
the  plane  passing  through  the  tail  stabilizer  to  map  the  scattered  fields  tangent  to  the  plane 
formed  by  the  tail  stabilizer  and  the  fuselage  of  the  aircraft.  For  another  series  of  IR 
thermograms,  the  screen  was  oriented  in  the  transverse  plane  that  intersected  the  middle  of 
the  wings  of  the  aircraft,  cf.  Figure  1 ,  to  map  the  scattered  fields  In  the  cross  section  of  the 
aircraft. 

3.1.  (R  Measurements 

Experimentally  obtained  IR  thermograms  of  the  scattered  fields  for  the  case  of  nose-on 
illumination  are  shown  in  Figure  2.  The  series  of  thermograms  cover  the  2  to  6  GHz  range  of 
frequencies.  In  each  IR  thermogram,  the  follow  diffraction  phenomena  are  discernable: 

i)  a  standing  wave  between  the  source  and  the  nose  of  the  aircraft  due  to  the 
scattered  wave  off  the  front  of  the  aircraft 

ii)  a  surface  wave  on  each  side  of  the  aircraft  between  the  nose  and  the  missile 
rails  on  the  wing  tips 

iii)  a  point  of  diffraction  on  the  aft  end  of  each  wing  tip 

iv)  a  shadow  zone  behind  the  aircraft 

Many  configurations  of  the  aircraft  were  tested  at  various  other  frequencies, 
polarizations,  and  angles  of  incidence.  The  IR  thermograms  shown  in  Figure  2  are  typical  of 
the  results. 

3.2.  GEMACS  Predictions 

The  test  case  of  horizontally-polarized,  nose-on  illumination  in  the  plane  of  the  wings 
of  the  aircraft  at  3  GHz  was  chosen  to  be  modeled  by  the  GEMACS  code.  The  test 
configuration  is  shown  in  Figure  3.  The  electric  field  distribution  as  determined  experimentally 
by  the  IR  imaging  technique  and  as  determined  numerically  by  the  GEMACS  code  are  both 
shown  in  Figure  3.  The  surface  current  which  created  the  diffracted  fields  are  also  shown  in 
Figure  3.  The  corresponding  IR  Imaging  data  for  the  magnetic  field  was  not  determined 
experimentally. 

After  examining  Figure  3,  it  can  be  noted  that  many  features  of  IR  image  and  the 
GEMACS  data  are  in  agreement,  such  as  the  prominent  standing  wave  at  the  front  of  the 
aircraft,  the  surface  waves  along  both  sides  of  the  aircraft,  and  the  diffraction  centers  behind 
the  wing  tips.  Overall,  the  comparison  is  good. 

However,  there  are  some  areas  of  disagreement.  The  most  obvious  is  the  intense  zone 
of  scattering  behind  the  aircraft  in  the  GEMACS  data,  as  compared  to  the  shadow  zone  in  the 
IR  image.  Some  of  these  differences  can  be  explained  based  on  differences  in  the  IR  and 
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Figure  2.  IR  Thermograms  (at  2,3,4  and  6  GHz) 

Nose-on  lllunnination,  Horizontal  Polarization  (plane  of  the  aircraft  wings) 
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GEMACS  models.  The  IR  model  of  the  F16  has  missile  rails;  the  GEMACS  model  does  not, 
due  to  excessive  computational  time  required  to  model  this  detail.  Also,  in  the  GEMACS 
model,  the  incident  wave  is  truly  a  plane  wave;  whereas,  in  the  IR  model,  the  aircraft  is  being 
illuminated  by  a  spherical  wave. 

An  updated  model  of  the  F16,  with  missile  rails  installed  and  illuminated  by  a  spherical 
point  source,  is  being  prepared  for  the  GEMACS  code,  since  the  IR  thermograms  must  be 
taken  in  the  near  field  of  the  source,  due  to  sufficiently  high  incident  energy  requirements  on 
the  model. 


4.  Conclusions 

The  IR  measurement  technique  Is  a  viable  method  to  aid  In  the  determination  of  EM 
fields  scattered  from  complex  metallic  objects  such  as  advanced  aircraft  models.  The  IR 
method  allows  for  rapid  observation  of  EM  field  activity  and  interference,  resulting  In  an  in- 
depth  understanding  of  EM  scattering  phenomena.  The  technique  can  be  used  to  verify  the 
accuracy  of  EM  codes  by  comparing  the  IR  thermograms  with  the  computed  results. 
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Abstract  The  reflection  and  transmission  of  nonsinusoidal  plane  TEM-waves 
which  are  perpendicular  incident  on  a  planar  interface  between  a  perfect  dielectric 
and  a  lossy  semi-infinite  medium  is  analytically  investigated  in  the  time  domain. 

In  addition  the  reflected  and  transmitted  waves  are  calculated  for  incident  impulse 
waves.  Due  to  the  losses,  continuous  terms  evolve.  The  distribution  in  reflected 
and  transmitted  parts  and  the  propagation  of  the  impulse- shaped  parts  can  easily 
be  described. 

1  Introduction 

The  solution  of  Maxwell’s  equations  in  the  time  domain  especially  with  the  simple  and  powerful 
finite  difference  time  domain  (FDTD)  algorithm  [1]  is  being  used  very  successfully  for  many  years. 
Nevertheless,  the  physics  of  time  domain  behavior  is  not  understood  completely.  Perhaps  this 
is  due  to  education  methods,  which  are  focussed  rather  on  frequency-  than  on  more  complicated 
time  domain  methods. 

In  the  present  work  a  new  method  operating  directly  in  the  time  domain  for  solving  scattering 
problems  with  conducting  media  is  presented.  Let  us  assume  the  reflection  and  transmission 
problem  at  a  planar  boundary  between  a  perfect  dielectric  and  a  conducting  half-space,  shown 
in  Fig.  1.  The  known  perpendicular  incident  uniform  plane  wave  with  an  arbitrary  wave  form 
function  £i(G  can  be  expressed  in  the  time  domain  as 

=  —  e^xEi(z,t) 

with  Ui  =  l/-\/£i^i ,  Zi  =  \ffiil£i.  (1) 

If  medium  2  would  also  be  lossless  (^<72  —  0),  we  could  make  the  time  domain  ansatz 
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Figure  1:  Reflection  and  transmission  problem 

Et{zyt)  ~  d- =  ~e^xEt{z,t) 

\  V2/  ^2 

with  V2  =  Z2  =  \J^2lC2  (2) 

for  the  reflected  and  the  transmitted  wave  with  unknown  constants  r  and  d,  which  can  be  de¬ 
termined  by  enforcing  the  boundary  conditions  at  the  interface  2  =  0.  This  leads  to  the  known 
Fresnel  coefficients 

Z2  —  Zi  j  2Z2 

“  ^2  +  ~  Z2  +  Z^'  ^  ^ 

This  proceeding  is  very  similar  to  the  frequency  domain  method  for  monochromatic  waves, 
where  =  cos(a;^)  or  £1(0  =  exp(ja;^) . 

For  cr2  >  0  things  change  dramatically.  The  main  differences  between  the  lossless  and  the  lossy 
case  are,  that  in  the  lossy  case 

•  the  wave  form  function  (WFF)  of  the  reflected  wave  is  different  from  the  WFF  of  the  incident 
wave. 

•  the  electric  and  the  magnetic  field  in  the  lossy  medium  cannot  be  described  with  a  simple 
WFF  and  there  is  no  connection  between  the  electric  and  magnetic  field  like  Ht(2,  t)  = 

X  Et(2,  t)  in  the  lossy  medium. 

•  we  have  to  introduce  unknown  functionSj  not  constant  coefficients.  The  unknown  functions 
will  be  determined  also  by  the  boundary  conditions. 

Now  we  try  to  solve  the  problem  given  in  Fig.  1  with  medium  2  of  finite  conductivity  (172  >  0). 
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2  Ansatz  functions  in  lossy  media 


In  the  lossless  medium  1  we  assume  again  the  incident  wave 

Ei(z.t)  =  £o6(i-^)ex,  =  i-exXEi(2,t)  (4) 

with  £i(^)  =  0  for  ^  <  0 .  Due  to  the  losses  in  medium  2,  the  reflected  wave  has  to  be  expressed 
with  an  unknown  WFF  ^(0  * 


Hr(^)0  ”  7  ezXEx{Zit). 


As  ansatz  for  the  transmitted  wave  in  the  lossy  medium  2  we  write 
Et(2,t)  =  Eo^+(— .-)ex,  Ht(2,t)  = 

with  the  positive  fundamental  solutions  published  by  us  in  [2]: 


^=0  ''  ^ 


with  =  y  (i'  +  z*){V  —  z*  —  () 

and  the  normalized  space  and  time  variables 


— ,  =  I 


with  the  double  relaxation  time  r  =  2^21^2  of  the  lossy  medium  2.  The  real  (!)  constants  V\f2 
and  Zi/2  are  defined  by  (1)  and  (2).  In  this  work  we  use  the  convention  that  all  integrals  are  zero 
when  the  upper  limit  is  less  than  the  lower  limit.  This  saves  us  the  use  of  many  unit  step  functions 
behind  the  integrals. 

These  fundamental  solutions  (7)  and  (8)  for  the  electric  and  the  magnetic  field  in  a  conducting 
medium  are  obtained  by  solving  a  characteristic^  initial  value  problem  (IVP)  with  Riemann  s 
method  [3]  directly  in  the  time  domain  and  can  be  seen  as  expansion  of  d’Alembert’s  (time 
domain)  solution  of  the  wave  equation  for  conducting  media.  Both  electric  and  magnetic  field 
depend  only  on  the  characteristic  WFF  F+(^)  • 

Many  authors  before  us  solved  Cauchy  IVP’s,  e.  g.  [4],  [6]^  [ll]^  and  [12].  For  the  use  as  ansatz 
functions  like  in  the  problem  shown  in  Fig.  1  the  fundamental  time  domain  solutions  are  better 
suited  than  solutions  of  Cauchy  problems,  because  at  the  interface  between  two  media  there  are  no 


^The  initial  values  are  given  on  the  charakteristic  directions  -f  z’  =  0  and  i'  -  z'  ■=■  0. 
^very  controversially  discussed  by  [6,  7,  8,  9,  10] 

^only  for  the  electric  field  strength 
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impressed  field  strengths  like  they  are  assumed  for  the  solution  with  Cauchy  initial  data.  The  field 
strengths  on  both  sides  of  the  interface  rather  adjust  themselves  due  to  the  boundary  conditions 
at  the  interface  which  will  be  taken  into  account  now.  Like  in  the  lossless  case,  where  we  have  to 
determine  the  constants  r  and  d,  we  now  look  for  some  expressions  for  the  determination  of  the 
unknown  characteristic  wave  form  function  and  the  unknown  WFF  Sr  of  the  reflected  wave. 


3  Enforcing  boundary  conditions 

Enforcing  continuity  on  the  tangential  components  of  the  electric  and  magnetic  fields  (4)-(6) 
across  the  interface 

and  the  introduction  of  improve  the  numerical  stability  finally  yields 

dSiirt')  =  -  i  /e-*'[rIo(i^)-t'M^]ef:F+(Od^  (9) 

dUrt')  =  rE+(i')  -  i  /  e-^' [lcK)-rt'^i^le^E+(Od|  (10) 

e=o  ^  ° 

with  —  0 . 

Contrary  to  all  expectations  the  Fresnel  coefficients  r  and  d  according  to  (3)  for  a  nonconducting 
medium  2  can  be  used  to  simplify  these  expressions. 

Equation  (9)  is  a  Volterra  integral  equation  of  the  second  kind  for  which  can  be  solved 

with  little  numerical  effort.  The  computation  of  St{z\t*)  y  7{t{z\i*)  and  follows  straight¬ 

forward. 


Without  solving  these  equations  we  can  conclude: 

•  For  Zi  <C  -^2)  L  e.  r  =  1,  d  =  2,  follows  Sr{t)  =  £l(t) ;  the  reflected  wave  has  the  same  WFF 
as  the  incident  wave. 


•  For  Zi  Z2i  i.e.  r  =  —1,  d  — >  0,  follows  Sr{t)  =  — £i(t);  the  reflected  wave  has  the  same 
WFF  as  the  incident  wave  up  to  the  factor  —1. 

•  For  i  =  0  follows  at  the  interface  z  =  0: 

E,(0,0)  =  rEoSi{0)e,  -  rEi(0,0),  (11) 

Et(0,0)  =  dEo£’i(0)e,  =  dEi(0,0),  (12) 

according  to  (9)  for  t'  =  0:  F{0)  =  d£'i(0)  =  d/r£^(0).  From  (11)  and  (12)  we  now  con¬ 
clude  that  the  distribution  of  the  wave  front  of  the  incident  wave  to  the  reflected  and  the 
transmitted  wave  front  is  independent  of  the  losses  in  medium  2.  This  will  last  even  for  an 
incident  wave  with  impulse-shaped  wave  form  as  it  will  be  shown  in  the  next  paragraph. 

Note  that  all  these  conclusions  are  independent  of  the  conductivity  era. 


4  Incident  impulse  wave 

Now  we  want  to  investigate  an  incident  wave  with  Dirac-impulse  shaped  WFF: 


The  modified  characteristic  WFF  F  consists  now  of  two  parts  —  a  Dirac  impulse  and  a  continuous 
part.  With 


(13) 


into  (9)  we  get  a  new  Volterra  integral  equation  of  the  second  kind  for  the  determination  of  the 
continuous  part  To  of  F : 


l-i' 

2 


t'  ^ 

[rlo(0-li(i')]  =  ToW-i  /  e-*' 


{=0 


r  loC^^b)  t 


(14) 


with  Wb  =  ~  0  • 


With  the  solution  ro(0  (^4),  the  normalized  fields  in  the  lossy  medium  2  follow  from  (7)  and 
(8)  with  (13) : 


=  d^e  ^  5(t' -  F)  e  ^  To{t' —  z')  - -e  ^ 


(15) 


1 7  [io(^.)  -  (t'  +  .')^1  ro(0  dA , 

24  L  J  J 


f)  =  d{  e-^  S{t'  -  z')  +  e-^  To{t'  -z')  +  -  e' 


lo(i')  +  {i'  +  z')  - 


(16) 


77[iok)+(4'+^')^ 


+ 


e=o 


m)^( 


with  !£;+  =  yj{t'  +  —  z*  —  ^) . 


Likewise  with  the  known  solution  ro(^)  of  (14)  the  WFF  of  the  reflected  field  with  the  argument 
x  r=  -|-  2'  follows  from  (10)  and  (13)  : 


5(x)  +  ro(x)  +  ie-==Ii(x)  +  ix  y 

X 

[0(2)+  J  e^^lo{w2)d( 


a-xIlMd^ 

W2 


(17) 


e-^l 


£=0 


with  W2  =  \fx{x  —  ^) . 
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Z\lZ2-=\  =>  7*  =  0,  d  —  1 


Z^IZ2  =  2  7- =  -1/3,  (i  =  2/3 


Figure  2:  Impulse  response  of  normalized  electric  (above)  and  magnetic  (below)  fields  for 
t'  =  0,  0.5,  1.0, 5.0  and  two  different  values  of  Z1/Z2 


In  Fig,  2  and  3  the  time  evolution  of  the  normalized  solutions  for  an  incident  Dirac  impulse  wave 
and  different  (always  real!)  values  of  ZxlZ2  is  shown.  The  incident  impulse  is  omitted  in  the 
plots.  For  z  <  0  the  space  variable  z‘  is  normalized  to  v^t  and  for  z  >  0  to  V2'r  ■  The  normalized 
magnetic  field  Tirfi  of  the  reflected  wave  is  defined  by 

t')  =  J  [r  (i'  +  z')]  =-Y^Sr.6  [r{t'  +  z')]  . 

It  is  an  unexpected  result  that  the  Dirac  impulses  are  divided  due  to  the  reflection  coefficient 
r  and  the  transmission  coefficient  d  like  for  the  lossless  case,  which  can  be  seen  directly  from 
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Figure  3;  Impulse  response  of  normalized  electric  (above)  and  magnetic  (below)  fields  for 
t'  =  0,  0.5,  1.0, 5.0  and  two  different  values  of  Z1IZ2 

(15)-(17).  We  conclude  that  the  incident  impulse  does  not  see  the  losses  in  medium  2  when  it  is 
reflected  for  =  0.  After  the  impulse  has  passed  the  interface  it  is  exponentially  damped  due  to 
the  losses  and  causes  continuous  waves^  which  are  propagating  also  into  the  lossless  medium  1. 
This  happens  also  in  the  case  Zi  =  Z2  (Fig.  2  right),  which  can  be  concluded  from  (17).  For  r  =  0 
the  wave  front  of  the  only  existing  continuous  part  of  the  normalized  reflected  wave  has  always 
the  amplitude  1/2,  independent  of  the  conductivity  cr,  which  only  has  to  be  greater  than  zero. 

Problems  with  conducting  layers  can  be  solved  with  this  method  in  the  same  way,  when  we 
take  into  account  the  “negative”  fundamental  solution  additional  to  . 
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Abstract 

Ongoing  work  at  Cray  Research  is  involved  in  the  evaluation  and  classification  of  codes 
and  techniques  used  in  the  numerical  solution  of  problems  in  electromagnetics.  Different 
formulations  typically  lend  themselves  to  certain  types  of  problems,  and  a  number  of  codes 
have  sprung  up  to  address  the  specialities.  Part  of  the  evaluation  effort  involves  providing 
feedback  to  code  developers,  whenever  possible,  on  algorithms  and  coding  structures  that  will 
allow  larger  problems  to  be  dealt  with  effectively  and  efficiently. 

It  is  the  purpose  of  this  note  to  report  on  certain  experiences  at  Cray  Research  with  the 
electromagnetic  scattering  code,  tentatively  named  JUNCTION-2,  under  development  at  the 
University  of  Houston  by  D.  R.  Wilton  and  his  team.  The  structure  of  the  code,  ^  well  as 
the  class  of  problems  it  addresses,  makes  it  a  good  candidate  for  a  new  class  of  iterative  linear 
system  solvers.  Examples  are  given  for  a  problem  arising  in  the  EMI/EMC  andysis  of  a  leaky 
equipment  panel  in  a  shielded  room.  The  results  will  be  presented  at  the  meeting  and  are  not 
included  here. 


1  Introduction 

Numerical  solution  techniques  for  problems 
in  electromagnetics  are  nearly  as  diverse  as  the 
problems  themselves.  Problems  in  antenna  de¬ 
sign,  EMI/EMC,  medical  imaging,  circuit  board 
prototyping,  motor  design,  plasma  evolution, 
etc.,  all  base  their  formulations  on  Maxwell’s 
equations.  Yet,  the  success  of  computer  simula¬ 
tion  of  the  fields  and  processes  of  interest  lies  in 
the  ability  of  the  formulation  (i)  to  adapt  itself 


accurately  to  the  geometry,  materials,  and  un¬ 
knowns  of  the  problem,  and  (ii)  to  be  amenable 
to  solution  by  approximate  techniques. 

For  example,  thin- wire  antenna  design  prob¬ 
lems  are  well-suited  to  classical  wire-based  in¬ 
tegral  equation  moment  methods;  RCS  calcu¬ 
lations  for  conducting  bodies  are  often  times 
naturally  handled  by  patch-based  integral  equa¬ 
tion  methods.  Thus,  these  problems  have  rather 


•crowQcray.com,  qsQcray.com,  Cray  Research,  Cray  Research  Park,  655E  Lone  Oak  Drive,  Eagan,  MN  55121 
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nice  formulations,  and  further,  benefit  from  the 
fact  that  the  theory  of  solution  approximation 
for  such  problems  is  rather  mature.  This  com¬ 
bination  leads  to  successful  implementations  as 
codes. 

The  present  study  arises  from  ongoing  work 
at  Cray  Research  to  evaluate  and  classify  high- 
quality  codes  and  techniques  of  interest  to  engi¬ 
neers  and  analysts  in  the  electromagnetics  com¬ 
munity.  The  goal  is  to  understand  better  the 
tools  available  to  such  workers,  and  to  provide 
feedback  and  assistance  on  techniques  and  algo¬ 
rithms  when  requested.  The  survey  to  date  has 
included,  in  addition  to  the  present  code,  low- 
frequency  finite  element  codes  used  in  the  de¬ 
sign  of  motors  and  generators,  and  the  scatter¬ 
ing  code  CARLOS-3D.  We  would  like  to  thank 
Don  Wilton  and  William  Brown  of  the  Univer¬ 
sity  of  Houston  for  their  assistance  in  this  work. 

2  The  JUNCTION-2  Code 

The  code,  tentatively  referred  to  as 
JUNCTION-2  by  D.  R.  Wilton,  is  under  de¬ 
velopment  at  the  University  of  Houston,  in  con¬ 
junction  with  Lawrence  Livermore  and  Sandia 
National  Laboratories,  and  the  US  Navy.  It 
is  a  method-of-moments  code  based  upon  the 
classical  Stratton-Chu  integral  formulation  of 
scattering  problems.  As  such,  it  allows  the 
user  to  formulate  problems  in  terms  of  electric 
field,  magnetic  field,  and  combined  field  integral 
equations.  [1] 

A  rather  important  aspect  of  JUNCTION-2 
is  its  ability  to  handle  structures  consisting  of 
both  body-type  scatterers  as  well  as  wire  scat- 
terers.  Thus  scattering  from  structures  and  cou¬ 
pling  onto  wire  antennas  can  be  handled  reliably. 
This  has  application  in  antenna  design  and  an¬ 
tenna  placement  problems  in  which  it  is  impor¬ 
tant  to  understand  the  effects  of  a  conducting 
environment  on  antenna  performance.  Such  an¬ 
tennas  include  long- wire  antennas  and  dipole  ar¬ 
rays  as  found  in  direction-finding  equipment  on 
board  naval  ships.  Also,  one  can  consider  cou¬ 
pling  to  cables  and  radiation  from  them. 


Element  Types  Models  of  the  scattering 
structures  are  assumed  to  be  made  up  of  flat 
triangular  patches  and  straight  wire  elements. 
The  wire  elements  may  be  joined  to  the  patches 
at  endpoint- vertex  junctions.  Impedance  load¬ 
ing  of  the  patches  is  accomodated. 

Basis  Functions  Surface  currents  are  piece- 
wise  linear  functions,  and  represented  in 
terms  of  the  Rao-Wilton-Glisson  basis  functions 
[2]  which  guarantee  that  current  flowing  into  an 
edge  is  balanced  by  the  current  flowing  out  of 
the  edge.  Possible  future  work  on  these  basis 
functions  would  address  the  physical  field  singu¬ 
larity  occurring  at  the  edges  of  nonsmooth  patch 
boundaries.  On  wire  elements,  piecewise  linear 
(chapeaux)  functions  are  used. 

Excitations  Excitation  of  structures  can  be 
in  the  form  of  plane  waves  or  voltage  sources. 
However,  the  modularity  of  the  code  is  such  that 
is  is  rather  easy  to  introduce  other  excitation 
types.  As  part  of  this  evaluation,  we  wrote  a 
general  excitation  package  that  was  added  and 
used  to  compute  the  scattering  of  a  dipole  field 
within  a  shielded  equipment  room.  [See  next 
section.] 

Output  The  surface  current  densities  com¬ 
puted  by  code  are  provided  as  output.  These 
allow  subsequent  near-  and  far-field  estimations 
to  take  place.  In  addition,  provision  is  made  for 
far-field  patterns  to  be  obtained.  In  this  Ccise, 
the  far-field  position  of  interest  is  provided  by 
the  user  in  the  input  deck 

3  Examples 

A  number  of  test  cases  were  used  to  evaluate  the 
performance  of  the  code,  and  were  selected  in 
order  to  represent  the  computational  step  in  an 
EMI/EMC  analysis  and  an  RCS  analysis.  Both 
use  the  combined-field  integral  formulation  to 
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avoid  problems  with  internal  resonances.  Here, 
we  report  on  the  results  of  the  EMI  problem. 

The  EMI/EMC  problem  models  a  leaky 
equipment  panel  in  a  shielded  room.  The  room 
is  13  ft  by  6.5  ft,  and  8  ft  high.  Fig.  1  shows  a 
cutaway  view  of  the  mesh  used  for  the  calcula¬ 
tions.  Near  the  right  side  of  the  equipment  rack 
is  a  small,  300  MHz  dipole  radiator  that  sub¬ 
sequently  produces  fields  throughout  the  room. 
The  first  step  in  an  EMI/EMC  assessment  is  to 
determine  the  ambient  fields  in  the  room;  once 
this  is  done,  an  estimate  of  the  effect  of  the  field 
on  cabling,  circuit  boards,  etc.  is  performed. 
Fig.  2  shows  schematically  the  density  of  the 
moment  matrix  of  the  CFIE  for  this  example. 

To  parameterize  the  problem,  the  currents 
induced  in  the  room  due  to  different  orientations 
of  the  dipole  radiator  were  computed.  Specif¬ 
ically,  360  different  orientations  of  the  dipole 
were  of  interest,  and  corresponded  to  360  exci¬ 
tations  of  the  structure.  The  orientations  were 
chosen  so  that  the  dipole  was  rotated  (parallel  to 
the  nearby  wall  of  the  equipment  rack)  through 
360  deg  in  1  deg  steps.  Another  run  was  made 
in  which  the  radiator  was  translated  through  the 
room  while  its  orientation  remained  fixed.  Dif¬ 
ferent  levels  of  mesh  refinement  were  used. 

4  Iterative  Solvers 

There  has  been  a  significant  amount  of  atten¬ 
tion  drawn  to  iterative  solver  technology  for  lin¬ 
ear  systems  of  equations.  The  main  attraction 
of  course  is  that  it  has  been  observed  in  prac¬ 
tice  that  high-quality  solution  estimates  are  ob¬ 


tained  in  a  relatively  small  number  of  iterations, 
maybe  on  the  order  often  to  fifty,  and  this  makes 
the  solution  process  0{N'^)  in  practice.  Here,  we 
will  discuss  some  experience  with  a  commercial 
solver,  C DENSE,  developed  by  Elegant  Mathe¬ 
matics,  Inc.^  This  solver  is  available  for  Cray 
platforms.  Contact  the  vendor  for  further  infor¬ 
mation. 

CDENSEisa  robust,  fast,  iterative  linear  sys¬ 
tem  solver.  Is  is  applicable  to  the  linear  systems 
with  complex,  dense  coefficient  matrices  and 
multiple  right-hand  sides  originating  from  3d 
boundary  integral  formulated  applications  such 
as  computational  electromagnetics  and  acous¬ 
tics. 

It  uses  high-quality,  numerically  stable, 
sparse  ILU  preconditioning  strategies  for  dense 
coefficient  matrices  and  thus  provides  an  asymp¬ 
totically  0{n^)  iterative  solution  using  precondi¬ 
tioned  GMRES-like  iterations  without  restarts. 
Special  optimizing  features  are  available  to  im¬ 
prove  considerably  CDENSE’s  efficiency  when 
solving  linear  systems  having  a  large  number  of 
multiple  right-hand  sides.  Special  versions  can 
be  configured  to  handle  specific  problems. 

The  key  distinguishing  characteristics  of  this 
solver  are  its  high  reliability,  its  efficiency,  and 
its  ability  to  produce  solutions  with  a  user- 
prescribed  accuracy  for  highly  ill-conditioned 
problems. 

A  great  deal  of  success  has  been  had  with 
such  solvers,  and  from  a  practical  standpoint  to 
the  engineer  and  analyst,  the  utility  lies  in  their 
ability  to  provide  in  practice  solutions  in  0(n  ) 
time. 
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Quantitative  Methods  for  Measuring  and  Improving  the 
Performance  of  Electromagnetic  Scattering  Codes  i 

J.  P.  Meyers,  A.  L  Terzuoli,  Jr.,  G.  C.  Gerace  •Air  Force  Institute  of  Technology 

INTRODUCTION:  A  set  of  quantitatively  based  evaluation  methods  is  presented  as  an  alternative  to 
qualitatively  based  evaluation  methods.  These  quantitative  methods  provide  a  way  of  making 
objective  evaluations  of  a  code's  performance.  In  addition,  these  methods  can  be  used  to  analyze 
predictions  of  complicated  scattering  patterns. 

Validating  an  electromagnetic  code  involves  comparing  predicted  results  with  theoretical 
results  for  a  set  of  known  scattering  objects.  These  comparisons  confirm  or  validate  that  the  code  is 
accurately  predicting  the  electromagnetic  field.  Although  in  principle  these  comparisons  sound  simple, 
in  reality  it  is  a  rather  involved  problem.  Usually,  the  primary  method  in  use  for  comparing  predicted 
and  theoretical  results  involves  overlaying  the  plots  of  the  two  data  sets.  This  approach  allows  for  a 
qualitative  comparison  between  the  two  data  sets  and  clearly  identifies  gross  errors  in  the  predicted 
results.  However,  it  is  not  an  adequate  evaluation  procedure  for  detecting  subUe,  yet  significpt 
differences,  nor  does  it  work  well  when  trying  to  validate  the  fields  from  objects  with  rapidly  varying 
(e,  0;  patterns.  The  evaluation  procedures  presented  use  a  quantitative  approach  for  making 
comparisons.  The  quantitatively  based  comparisons  allow  for  an  unbiased,  objective  analysis  and  can 

be  applied  to  alUft^)  patterns.  ^  ^  •  a  -ru 

These  evaluation  procedures  should  be  applicable  to  a  wide  range  of  prediction  codes.  1  he 
procedures  focus  on  the  values  of  the  scattered  field  or  the  ( 9,  <l>)  pattern.  It  is  assumed  that  the 
scattered  field  can  be  obtained  from  the  output  of  the  specific  prediction  code  of  interest.  Since  the 
procedure  does  not  depend  upon  specific  features  of  any  single  prediction  code,  it  is  reasonable  to 
assume  that  they  will  perform  equally  well  for  any  prediction  code. 

Sources  of  reference  data  Sets:  This  section  addresses  the  issue  of  producing 
reference  data  sets  with  which  the  predicted  results  are  compared.  The  limited  source  of  exact 
analytical  solutions  is  presented  followed  by  a  discussion  of  using  measured  data.  In  the  ^ction  on 
using  measured  data,  the  issue  of  measurement  error  is  addressed.  Solutions  for  eliminating  or 
accounting  for  some  of  the  errors  are  presented. 

Exact  Analytical  Solutions:  The  evaluation  procedures  compare  predicted  results 
with  theoretical  data  sets.  Ideally  the  comparisons  would  be  made  using  scattering  objects  for  which 
an  exact  analytical  solution  exists.  However,  exact  analytical  solutions  are  known  for  few  objects. 
One  such  object  is  a  perfect  electric  conducting  (PEC)  sphere.  Other  objects  for  which  exact  or  near 
exact  solutions  exist  include:  flat  plates,  circular  cylinders,  and  semi-infinite  wedges.  Definitely 
absent  from  the  list  are  objects  with  many  scattering  centers  [1].  Since  so  few  exact  analytic  solutions 
exist,  comparisons  are  usually  made  with  measured  scattered  field  patterns. 

Measured  data:  A11  measured  data  is  subject  to  various  measurement  errors  ^d  system 
limitations  which  introduces  a  level  of  uncertainty  in  the  evaluation.  But  if  the  predictions  agree 
within  the  measurement  error,  the  code  can  be  considered  to  produce  valid  results.  Therefore,  it  is 
important  to  either  know  the  error  limits  of  the  measured  data  or  be  able  to  estimate  it. 

Random  error  (e.g.  noise)  is  not  the  only  source  of  error  that  must  be  addressed  during  the 
evaluation  procedure.  The  systematic  errors  inherent  in  the  measurement  process  pose  a  potentially 
more  serious  problem.  These  errors  can  include,  misalignment  of  the  scattering  object,  calibration 
errors,  and  various  approximations  and  assumptions  made  in  the  design  of  the  measurement  facility. 
Unfortunately  little,  if  anything,  can  be  done  to  reduce  the  effects  of  calibration  error.  If  significant 
calibration  errors  are  known  to  exist  in  the  measured  data,  the  best  solution  is  to  repeat  the 
measurements  after  re- calibrating  the  measurement  system. 


^  This  work  was  supported  by  the  U.  S.  Air  Force  Wright  Laboratories. 
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Other  common  systematic  errors  are  not  as  critical  as  calibration  errors.  It  is  well  known  [2,  3, 
4,  5]  that  misalignment  by  as  much  as  0.5°  in  either  azimuth  or  elevation  is  enough  to  produce 
significant  but  subtle  differences  especially  for  objects  with  many  scattering  centers.  Another  common 
source  of  systematic  errors  is  the  uniform  plane  wave  approximation  used  by  measurement  facilities. 
Ideally,  the  scattering  object  should  be  illuminated  with  a  uniform  plane  wave  (constant  amplitude  and 
phase),  but  this  would  require  an  infinite  aperture.  One  method  of  reducing  the  error  associated  with 
uniform  plane  wave  approximations  is  to  use  a  compact  range.  This  type  of  measurement  range 
creates  a  better  uniform  plane  wave  approximation  by  using  a  large  parabolic  reflector  to  convert  the 
incident  spherical  wave  into  a  near  uniform  plane  wave  over  a  finite  test  volume.  Uniform  plane  wave 
approximation  errors  tend  to  create  shifts  in  the  overall  magnitude  of  the  measured  scattered  field 
levels  .  Therefore,  it  may  be  better  to  compare  relative  levels  instead  of  absolute  levels. 

Sometimes  systematic  errors  can  be  addressed  during  the  comparison  process.  Scattered  field 
data  sets,  especially  azimuth  and  elevation  sweeps,  tend  to  be  characterized  by  large  peaks.  For 
azimuth  and  elevation  sweeps,  these  peaks  can  correspond  to  large  specular  reflections,  which  are 
characteristic  of  smooth  surfaces  with  large  radii  of  curvature,  but  can  also  be  created  by  other 
phenomena  as  well.  These  peaks  can  be  used  for  alignment  purposes  in  both  angular  coordinates  and 
field  magnitudes.  Often  approximate  calculations  can  be  used  to  estimate  the  actual  peak  value.  This 
value  is  then  used  to  shift  both  the  measured  and  predicted  values.  A  short  cut  that  often  works  equally 
well,  is  to  shift  either  one  of  the  two  data  sets  to  correspond  to  the  value  of  the  other  one.  In  both 
cases,  the  relative  changes  in  field  values  are  compared.  This  has  the  added  benefit  of  separating 
errors  in  overall  levels  from  errors  in  relative  changes  within  the  data  set. 

Error  Metrics:  Once  an  acceptable  reference  data  set  has  been  obtained,  it  is  necessary  to 
determine  exactly  how  the  two  data  sets  will  be  quantitatively  compared.  Each  data  set  consists  of  N 
x-y  data  points  which  correspond  to  the  scattered  field  of  the  test  object  as  a  function  of  either  azimuth, 
elevation,  or  frequency.  The  purpose  of  the  error  metric  is  to  accurately  describe  the  overall  agreement 
between  the  two  data  sets  with  as  few  numbers  as  possible.  Ideally,  the  level  of  agreement  can  be 
represented  with  a  single  numerical  value.  Several  error  metrics  that  measure  the  level  of  agreement 
between  two  data  sets  are  presented.  It  is  assumed  that  the  two  data  sets  have  the  same  number  of  data 
points  corresponding  to  the  same  values  for  the  independent  variable.  The  metrics  proposed  are  only 
one  set  of  possible  error  metrics.  Other  metrics  may  exist  that  work  equally  well  for  various 
circumstances.  The  best  approach  for  evaluating  potential  metrics  involves  observing  how  they 
respond  to  various  types  of  differences  between  reference  and  test  data.  For  each  of  the  metrics  in  this 
study,  the  following  definition  of  error  is  used: 

reference  predicted 

where  a  is  the  echo  area  (or  radar  cross  section,  RCS)  measured  in  either  linear  area  units  (m^)  or 
logarithmic  units  (dBsm).  It  is  important  to  note  that  logarithmic  units  tend  to  enhance  the  effects  of 
nulls  whereas  linear  units  tend  to  suppress  nulls.  Again,  experimentation  is  the  best  way  to  determine 
which  is  better  for  a  given  set  of  circumstances. 

Mean  Square  Error  (MSE):  The  mean  square  error  is  a  simple  way  of  measuring  the 
average  error  of  all  data  points  in  a  prediction.  This  measure  does  not  allow  positive  and  negative 
errors  to  cancel  each  other  since  the  error  values  are  squared  before  summation.  This  measure  is 
calculated  by 


MSE  =  -Yef 

ntt  (2) 

where  the  error  of  the  data  value,  et^  is  given  by  (1).  The  number  of  points  in  each  data  set  is  given 
by  n.  Small  values  for  MSE  indicate  good  overall  agreement  between  the  two  data  sets.  However,  a 
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large  MSE  does  not  necessarily  mean  poor  agreement.  Since  the  error  values  are  squared  before 
summation,  relatively  large  error  values  have  more  influence  on  the  value  of  the  mean  square  error. 

Mean  error  (ME):  Like  the  mean  square  error,  the  mean  error  measures  the  average 
error  of  all  data  points  in  a  prediction.  Unlike  the  mean  square  error,  the  mean  error  allows  positive 
and  negative  errors  to  cancel  each  other.  Due  to  this  cancellation,  a  relatively  poor  prediction  can 
result  in  a  low  error  value.  The  mean  error  is  calculated  as 


ME  =  -Ye: 

nti  . 


(3) 


A  low  value  for  ME  can  mean  one  of  two  things:  good  agreement  between  the  two  data  sets,  or  large 
errors  were  canceled  with  large  or  numerous  smaller  errors  of  the  opposite  sign. 

STANDARD  DEVIATION  (STD):  The  standard  deviation  measures  the  degree  of  variation 
about  the  mean.  Therefore,  it  is  a  good  error  measure  to  use  when  the  measured  data  is  relatively 
constant  and  the  predicted  data  deviate  from  that  value.  An  azimuth  scan  of  a  sphere  or  a  cylinder  is 
an  example  where  the  standard  deviation  may  be  a  good  measure  of  the  error.  Since  the  scattered  field 
of  a  sphere  and  cylinder  don't  change  with  aspect  angle,  the  scattered  field  should  be  constant. 
However,  since  many  prediction  codes  use  faceted  models  and  the  predicted  scattered  field  will  vary 
with  aspect  angle.  The  standard  deviation  is  calculated  by 


STD  =  .-{MSE-ME^) 
V  n  -1 ' 


(4) 


Percentage  Agreement  (PA):  since  most  predictions  are  compared  to  measured 
data,  it  is  necessary  to  account  for  random  measurement  error  (noise)  in  the  measurement  system.  This 
percent  agreement  metric  calculates  the  percentage  of  the  predicted  data  points  which  have  error  values 
less  than  the  measurement  error,  or  some  other  acceptable  level  of  error.  This  error  metric  is  flexible 
enough  to  account  for  measurement  errors  that  vary  as  a  function  of  the  independent  variable.  In 
addition,  if  an  acceptable  error  level  has  been  established,  that  error  level  can  be  used  to  determine  the 
percentage  agreement.  Percentage  agreement  is  calculated  by 


PA  = 


#(|  e\  <  err] 

- i-100 

n 


(5) 


where  err  is  the  acceptable  error  threshold  value  which  can  be  a  constant  or  a  function  of  the 
independent  variable.  The  pound  sign  (^)  in  the  numerator  is  used  to  indicate  the  total  number  of  error 
values  which  satisfy  the  expression  in  parentheses. 

Noise  floors:  in  addition  to  measurement  errors,  measured  data  contains  a  noise  floor 
which  is  the  level  of  the  background  environment  which  cannot  be  deleted  from  measured  data.  The 
noise  floor  depends  upon  the  conditions  under  which  the  measurements  were  made.  It  varies  from  one 
measurement  facility  to  the  next  and  can  even  depend  upon  the  scattering  object  being  measured. 
Measured  field  levels  close  to  the  noise  floor  are  indistinguishable  from  system  noise,  thus  are 
meaningless.  Therefore,  each  of  the  error  metrics  listed  can,  and  generally  should,  be  modified  to 
include  only  those  data  values  which  are  significantly  above  the  noise  floor. 

DATA  PROCESSING  AND  FILTERING:  For  complicated  objects,  the  back  scattered  field  with 
respect  to  azimuth  angle  tends  to  have  rapid  fluctuations  separated  by  only  a  fraction  of  a  degree,  with 
a  dynamic  range  of  20  to  40  dB  [6].  In  addition  to  these  rapid  fluctuations,  an  underlying  trend  exists 
based  on  the  geometry  and  composition  of  the  scattering  object.  Therefore,  filtering  of  the  data  such 
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that  the  underlying  trend  becomes  apparent  is  useful.  Several  methods  of  smoothing  scattered  field 
data  are  traditionally  used.  In  general,  they  involve  a  sliding  window,  in  which  the  center  data  point  is 
replaced  with  a  value  (e.g.  mean  and  median)  related  to  all  of  the  data  points  in  a  window  of  fixed  but 
arbitrary  width.  Since  scattered  field  data  is  commonly  plotted  in  logarithmic  (dBsm)  or  linear  (m^) 
units,  the  issue  arises  concerning  in  which  scale  the  data  is  best  smoothed.  However  taking  the  mean 
of  logarithmic  data  is  the  same  as  taking  the  geometric  mean  of  linear  data. 

None  of  the  traditional  window  smoothing  methods  attempt  to  take  into  account  the  geometry 
of  the  scattering  object.  The  spacing  of  the  peaks  and  nulls  in  the  scattered  field  data  corresponds  to 
how  rapidly  the  relative  scattering  centers  move  in  and  out  of  phase.  For  example,  in  the  case  of  a  long 
narrow  object,  the  peaks  and  nulls  will  be  spaced  more  closely  when  viewed  from  broadside  (90°)  than 
when  viewed  from  either  end  (0°  or  180°).  The  path  distances  from  the  transmitter  to  each  end  vary 
more  rapidly  at  broadside  than  they  do  at  other  incident  angles,  thus  creating  the  higher  density  of 
peaks  at  broadside. 

We  now  present  throe  filtering  techniques  which  account  for  these  variations  in  peak  to  null 
separation.  All  three  methods  use  the  sliding  window  concept,  however  the  width  of  the  window 
width  is  variable  over  the  entire  azimuth  of  the  scattered  field  data  set.  Initially,  two  window  widths 
are  calculated,  one  for  the  end  aspect  and  the  other  for  the  broadside  aspect.  These  two  window  widths 
are  based  on  a  bounding  box  which  circumscribes  the  scattering  object.  In  the  case  of  a  generic  long 
narrow  object,  this  is  a  simple  rectangular  box.  The  box  widths  at  the  two  azimuths  can  be  viewed  as  a 
rectangular  window  on  the  surface  currents  of  the  scattering  object.  In  going  from  the  induced  currents 
on  the  object  to  the  scattered  far  field,  a  Fourier  Transform  may  be  applied  to  these  windows  to  get  a 
corresponding  sine  i.e.  (sin  x)/x.  The  null-to-null  width  of  the  sine  main  lobe  is  used  as  the  window 
width.  Since  the  projection  of  the  scattering  object  on  the  transmitter  varies  as  a  function  of  azimuth, 
the  window  width  should  also  vary  as  a  function  of  azimuth.  This  variation  is  approximated  by  a 
eosine  on  a  pedestal  given  by 

W(e)  =  W(0°)  +  W(90")  •  cos(0)  (5) 

where  W  represents  the  null-tomull  width  of  the  sine  main  lobe.  This  is  also  the  variable  width  of  the 
sliding  window  that  is  used  for  filtering.  Now  that  the  variable  window  width  has  been  defined,  the 
next  step  is  to  establish  the  operation  that  will  be  applied  to  the  data  within  the  variable  window. 

POWER  Scale  Weight:  The  first  two  filtering  techniques  perform  a  weighted  average  of 
the  data  points.  The  weighting  will  correspond  to  a  sine^,  i.e.  ((sin  x)/x)^  similar  to  scattered  power 
raw  data  which  is  proportional  to  the  square  of  the  scattered  E-field.  Tlie  first  technique  averages  data 
in  logarithmic  units  while  the  second  technique  averages  data  in  linear  units  (sm).  The  two 

filtering  techniques  are  given  by 
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Figure  L  Raw  Generic  Long  Object  Scattered  Field  Data.  HH polarization,  18  GHz. 

where  <t  is  the  magnitude  of  the  echo  area  in  dBsm  and  n  is  the  number  of  data  points  in  the  filtering 
window  centered  about  the  i*  data  point  as  given  by  (6).  The  numerical  subscript  on  vindicates  the 
particular  filtering  technique. 

FIELD  SCALE  WEIGHTS:  The  third  and  final  technique  takes  into  account  that  the  raw  data 
is  actually  processed  scattered  E^-filed  data.  In  fact,  by  using  echo  area  values,  the  phase  information 
of  the  scattered  E-field  is  lost.  Therefore,  the  third  technique  converts  the  echo  area  value  back  into  a 
value  proportional  to  the  scattered  E-field.  As  long  as  the  phase  of  the  scattered  E-field  is  recorded 
along  with  the  magnitude  of  the  echo  area,  it  is  possible  to  recover  all  of  the  information  obtained 
during  the  measurement  or  prediction  process.  The  echo  area  (or  radar  cross  section,  RCS)  o  is  defined 

by 


a  -  lim  4kR^ 


¥ 


(9) 


where  and  are  the  scattered  and  incident  E-fields  respectively. 

It  is  necessary  to  understand  how  the  scattered  field  measurements  were  made  in  order  to  be 
able  to  extract  the  desired  information  about  the  scattered  E-field.  Since  static  scattered  field 
measurements  are  typically  made  in  the  presence  of  other  scatterers,  it  is  necessary  to  make  calibration 
measurements  of  a  scattering  object  for  which  the  scattered  field  is  known.  Typically,  four  sets  of 
measurements  are  made  to  obtain  a  single  scattered  field  signature.  First,  the  scattenng  from  the 
support  structure  for  the  calibration  object  is  measured.  Second,  the  scattering  from  the  calibration 
object  and  its  support  structure  is  measured.  Third,  if  a  different  support  structure  is  used  for  the  test 
object,  the  scattering  from  it  is  measured.  Finally,  the  scattering  from  the  test  object  and  its  support 
structure  are  measured.  With  these  four  measurements  and  the  known  scattered  field  of  the  calibration 
object,  it  is  possible  to  calculate  the  echo  area  of  the  test  object.  This  calculation  is  given  by 
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Figure  2.  Processed  Scattered  Field  Data  by  Filtering  in  Logarithmic  Units. 
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The  subscripts  refer  to  the  test  (t),  test  background  (th),  calibration  (c),  and  calibration  background 
(cb).  Each  of  the  E~fields  are  complex  quantities.  The  phase  recorded  with  the  magnitude  of  the 

scattered  field  is  that  of  the  numerator  in  (10).  Therefore,  a  complex  quantity,  X,  proportional  to  the 
scattered  E-field  of  the  test  object  is  given  by 


|x|  =  Vio^** 


(11) 


and 


X  =  |x|cos(0)  +  7|x|sin(<(>) 


(12) 


where  (j)  is  the  phase  of  the  scattered  E-field  which  was  recorded  during  the  measurement  or  prediction 
process. 

Instead  of  processing  the  magnitude  of  the  scattered  field  as  was  done  in  the  first  two  filtering 

techniques,  the  third  technique  processes  the  complex  quantity  X,  which  is  proportional  to  the 
complex  scattered  E-field.  Once  again,  the  sliding  window  size  is  given  by  (6)  and  a  weighted  mean  is 

calculated  on  the  data  values  within  each  window.  Since  the  complex  quantity  X  is  proportional  to 
the  complex  scattered  E-field  and  not  the  square  of  the  E-field,  the  weights  are  given  by  the  sine 
function  instead  of  a  squared  sinc^.  The  third  filtering  technique  is  then  given  by 
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Figure  3.  Processed  Scattered  Field  Data  by  Filtering  in  Linear  Units. 


X3, 


1  i+n 
=— 

2n  +  \,T-n 


Xk 


sin(^g-) 


(13) 


Once  the  complex  quantities  have  been  processed,  they  are  converted  back  into  standard  echo  area 
values  and  plotted  in  units  of  dBsm.  Thus  the  processed  echo  area  using  the  third  technique  is  given 
by 


CT3,  =  101ogio(|x,,,f)  (14) 

Comparison  of  RLTERING  Techniques:  Measured  data  from  a  generic  long  object 
is  used  as  the  sample  data  set.  The  measured  data  was  taken  at  18  GHz  with  angular  step  size  of  0.25“ 
using  horizontal  polarization.  Figure  1  is  a  plot  of  the  raw  scattered  field  data  plotted  in  dBsm  versus 
azimuth  angle.  As  indicated  earlier,  the  scattered  field  varies  rapidly  over  extremely  small  sweeps  in 
azimuth  angle.  Figures  2  through  4  correspond  to  the  three  filtering  techniques.  The  first  two  filtering 
techniques  produce  fairly  similar  sets  of  smoothed  data.  The  peaks  and  nulls  are  slightly  more 
pronounced  in  the  first  technique  which  performs  the  weighted  averaging  on  logarithmic  data.  An 
interesting  observation  is  that  the  main  peak  at  90“  is  almost  identical  over  a  span  of  about  70“  for  the 
first  two  filtering  techniques.  As  might  be  expected,  the  third  filtering  technique  produces  a 
significantly  different  set  of  smoothed  data.  The  coherent  adding  of  the  complex  values  is  responsible 
for  the  increased  jaggedness  and  multiple  peaks  and  nulls  at  90“  Although,  not  readily  evident  from 
the  figures,  the  values  resulting  from  technique  #3  are  all  less  than  the  values  resulting  from  the  other 
two  techniques.  These  lower  values  are  caused  by  the  summation  of  complex  values  instead  of 

magnitudes.  t-  t. 

It  is  difficult  to  make  a  general  choice  of  processing  technique  for  use  in  all  cases.  For  the 
purposes  of  generic  long  object  data,  either  one  of  the  techniques  that  average  scattered  field  values 
appears  to  work  well.  The  third  technique  does  not  produce  smooth  results  at  broadside  for  this 
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Figure  4.  Processed  Scattered  Field  Data  by  Filtering  of  complex  E-field  values  in  Linear  Units. 

particular  set  of  data.  However,  its  failure  for  this  data  set  should  not  exclude  it  from  considerations 
for  use  in  other  cases. 

Summary:  Evaluation  of  an  EM  scattering  prediction  code  requires  several  choices  and  decisions 
to  be  made.  First,  a  set  of  test  scattering  objects  must  be  chosen  which  will  create  the  various 
scattering  phenomena  of  interest.  Second,  reference  data  sets  must  be  obtained  for  each  test  object. 
These  reference  data  sets  may  come  from  analytical  solutions,  measured  data,  or  other  prediction 
codes.  If  another  prediction  code  is  used  to  create  the  reference  data,  it  must  be  generally  accepted  to 
produce  accurate  results,  or  the  evaluation  is  of  little  significance.  Finally,  a  method  of  comparing 
predicted  and  reference  data  must  be  chosen.  It  is  highly  desirable  that  a  quantitative  comparison 
method  be  chosen  to  compliment  qualitative  comparisons.  Quantitative  comparisons  provide 
unbiased,  objective  comparisons  and  accommodate  both  simple  and  complicated  scattered  field 
patterns.  If  complicated  scattered  field  patterns  are  being  compared,  data  processing  and  filtering  may 
simplify  the  comparisons. 
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1.  Introduction 

High  voltage  transmission  lines  are  the  important  and  necessary  components  of  an  electric  energy 
distribution  system.  The  electromagnetic  field  of  the  transmission  lines  may  affect  human  beings  and 
sensitive  electronic  devices  in  its  neighborhood.  The  interconnected  electromagnetic  influences 
between  the  active  electromagnetic  system  and  its  environment  which  is  called  Electromagnetic 
Compatibility  (EMC)  has  become  increasingly  important  recently  because  of  its  wide  ranging 
industrial  implications.  Another  name  "Electrosmog"  for  EMC  has  been  introduced  to  stress  the 
significance  of  electromagnetic  environment  pollution  [1]. 

Basically  there  are  two  types  of  high  voltage  transmission  lines.  The  one  is  an  overhead 
transmission  line  and  the  other  an  underground  cable.  The  EMC  problems  in  the  overhead 
transmission  line  power  system  have  been  extensively  discussed,  see  e.g.  [2].  An  obvious  negative 
biological  effect  of  the  electromagnetic  field  caused  by  the  overhead  transmission  line  power  system 
has  not  been  recognized  [2,  3]. 

The  influence  of  the  underground  power  cable  systems  on  the  environment  is  more  prominent. 
This  is  easy  to  understand.  First,  the  underground  power  cable  system  is  generally  adopted  in  the 
urban  district  of  dense  population.  Secondly,  the  distance  from  the  cables  to  the  earth  surface  is 
smaller  than  that  in  the  overhead  transmission  line  power  system.  Therefore,  it  is  more  interesting  to 
investigate  the  EMC  quality  of  the  underground  power  cable  system. 

Besides  the  possible  biological  effects  of  the  electromagnetic  field  on  human  beings,  some  other 
medical  problems  also  exist.  For  example,  questions  are  raised  with  regard  to  the  behavior  of  pace¬ 
makers  under  the  influence  of  the  electromagnetic  field  of  the  underground  power  cable  systems.  The 
new  limit  value  of  the  magnetic  flux  density  of  30  fiX  for  the  normal  operation  of  the  pace-makers 
has  been  put  forward  and  recognized  in  Germany,  However,  in  some  of  the  underground  power  cable 
systems  (400  kV)  the  magnetic  field  in  the  region  above  the  ground  may  exceed  this  limited  value. 
Thus,  the  shielding  of  the  magnetic  field  produced  by  such  an  underground  cable  system  is  necessary. 

The  shielding  of  the  magnetic  field  has  been  successfully  achieved  by  using  a  thin  ferromagnetic 
plate.  This  is  a  typical  classic  open  magnetic  shielding  problem.  In  order  to  effectively  design  the 
magnetic  shielding  for  the  underground  cable  system,  it  is  essential  to  compute  the  magnetic  fields  of 
the  power  cable  systems  in  the  existence  of  the  shielding  plates  or  other  ferromagnetic  materials  such 
as  steel  girder.  For  this  purpose,  a  software  (HYBEMC  1.0)  based  on  the  Hybrid  Finite  Element  (FE) 
and  Boundary  Element  (BE)  method  has  been  developed  [4].  In  this  Hybrid  FE-BE  solution  strategy, 
the  nonlinearity  and  inhomogeneity  of  the  ferromagnetic  materials  are  treated  by  the  FE  method  while 
the  infinite  exterior  space  is  considered  by  the  BE  method.  To  improve  the  solution  accuracy  and  the 
user-pleasantness,  an  adaptive  mesh  generation  is  also  implemented. 

In  this  paper,  the  Hybrid  FE-BE  method  and  the  features  of  the  developed  software  are  to  be 
reviewed.  Then,  some  application  examples  will  be  represented.  Consequently,  one  example  of  the 
design  of  the  magnetic  shielding  system  by  means  of  the  numerical  simulation  will  be  given  to  show 
the  approach  to  design  the  magnetic  shielding  in  the  underground  power  cable  systems. 


*  This  project  was  funded  by  Siemens  AG,  Germany 
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2.  Hybrid  FE-BE  Method 


In  the  2-D  electromagnetic  field  problems  in  which 
the  exciting  current  has  only  one  component,  say  z- 
component,  see  Fig.  1,  the  magnetic  vector  potential 
has  also  one  component  and  automatically  satisfies 
the  Coulomb  Gauge. 

In  terms  of  the  magnetic  vector  potential,  the 
electromagnetic  field  problems  at  power  frequency 
can  be  formulated  by: 


1.  V  — 
5;c  dXj 


Sj’l  dyj 


in 


Figure  1  Model  configuration  of  2-D 
electromagnetic  fields 


and 


^  d^A 
dx^  dy^ 


in  ^be 


(2) 


where  A  is  complex  phaser  of  the  vector  potential,  v  reluctivity,  Pq  permeability  in  vacuum,  a 
conductivity,  m  angular  frequency,  Jq  and  the  exciting  current  sources  in  Qp^  and  respectively. 

2.1  FEM  for  the  region 

At  first,  the  weak  FEM  formulation  of  eqn.  (1)  can  be  written  as: 


dAdW  dAdW  .  ^  a. 

. - +  V - +  i(}ioAW  dxdy 

dx  dx  ay  ay 


£  W-^  —  dV 
dn 


-  f f  j^Wdxdy  =  0 


where  W  is  the  weight  function  and  n  is  the  outward  normal  vector  at  the  interface  F^.  We  note  here 
that  the  contour  integral  term  of  eqn.  (3)  will  only  be  considered  when  the  FE  element  shares  a 
common  boundary  with  the  region  otheiA\nse,  the  contribution  of  this  term  is  omitted. 

2.2  BEM  for  the  region 

The  BE  formulation  of  eqn.  (2)  can  be  derived  by  using  the  Weighted  Residual  Method  [7]: 


where  G*  is  the  fundamental  solution  of  eqn.  (2). 

2.3  Coupling  the  FE  and  BE  equations 
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At  the  interface  between  Qp^  and  Q^e,  there  exist: 


^BE 

II 

_l( 

1^0  i 

dn  j 

A 

By  means  of  eqn.  (5),  eqn.  (3)  and  eqn.  (4)  can  be  coupled  into  a  system  of  equations,  in  a  matrix 
form  [4]: 


C'X  =  Y 


(6) 


where  X  = 


y  =  [F  a^Y  ^  = 


V{H) 


K\  T  and  F  are  the  coefficient  matrices  of  eqn.  (3)  and  H,  G  and  the  coefficient  matrices  of  eqn. 
(4).  T(H)  means  that  the  column  of  the  coefficient  matrix  H  is  relocated  according  to  the  FE  node 
numbering.  In  implementation,  a  mixed  element  discretization  i.e.  a  linear  variation  of  the  vector 
potential  over  a  FE  element  and  a  constant  distribution  of  the  normal  derivative  over  a  boundary 
element  is  used  [5,6]. 

It  has  to  be  mentioned  that  the  system  matrix  C  is  unsymmetric  because  of  the  contribution  of 
the  BE  equation.  This  makes  the  solution  of  the  system  equations  difficult  and  partially  compensates 
the  advantage  of  the  Hybrid  FE-BE  method. 


3.  Features  of  the  Developed  Software 
3J  Treatment  of  nonlinear  eddy  current  problems 

For  the  nonlinear  eddy  current  problems,  the  physical  values  are  not  sinusoidal  when  the  exciting 
current  is  yet  sinusoidal.  In  other  words,  the  permeability  of  the  ferromagnetic  materials  is  dependent 
on  not  only  position  but  also  time.  Therefore,  the  nonlinear  eddy  currents  cannot  be  represented  in 
terms  of  the  complex  phaser  with  single  frequency  [7].  However,  the  real  permeability  can  be  replaced 
by  the  so-called  equivalent  permeability  when  the  wave  shapes  of  the  physical  variables  are  not 
interesting.  The  equivalent  permeability  is  only  a  function  of  position.  Then,  the  nonlinear  eddy 
current  problems  can  be  still  formulated  by  eqn.  (1). 

There  are  two  kinds  of  equivalent  permeabilities:  RMS  model  [8,9]  and  energy  model  [8].  In  this 
paper,  the  RMS  equivalent  permeability  is  adopted  [4]. 

For  the  nonlinear  problems,  the  resulting  system  of  equations  is  the  same  as  eqn.  (6)  but  the 
system  matrix  is  dependent  on  the  nonlinear  reluctivity,  i.e. 

C(v)  X  =  Y  0) 

To  solve  this  nonlinear  equation  system,  the  damped  Newton-Raphson  method  is  applied  [4]. 
However,  the  numerical  experiments  showed  that  the  convergence  of  iteration  is  not  ensured  in  the 
case  of  thin  ferromagnetic  plates.  In  order  to  overcome  this  difficulty,  a  new  method  to  compute  the 
reluctivity  in  the  iterative  process  is  implemented.  In  this  secant  method  [10],  the  hew  reluctivity  in 

the  n.th  iteration  is  derived  as  the  secant  where  obtained  from  the  B-H  curve 
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according  to  the  magnetic  field  strength  and  is  directly  computed  from  the 

Hybrid  FE-BE  method  solution.  In  this  way,  the  iteration  convergence  can  be  greatly  improved. 

3.2  Solution  of  the  system  of  complex  and  unsymmetric  equation  system 

As  mentioned  in  section  2.3,  the  system  matrix  C  is  complex  and  unsymmetric.  To  solve  this  kind  of 
equations,  various  iterative  solvers  have  been  numerically  investigated.  We  have  tested  three  iterative 
solvers:  BCG  (BiConjugated  Gradient  method),  GMRES  (Generalized  Minimal  Residual  method)  and 
CGS  (Conjugated  Gradient  Squared)  with  several  ILU-preconditioners  (Incomplete  LU 
decomposition).  In  the  case  of  the  test  examples  resulted  from  the  Hybrid  FE-BE  formulation  for  the 
EMC  problems  in  the  cable  systems,  the  divergence  of  the  solution  of  the  equation  systems  by  using 
BCG  and  GMRES  has  not  been  recognized.  The  convergence  rate  of  BCG  is  faster  than  that  of 
GMRES  [11]. 

However,  the  convergence  of  CGS  is  not  ensured.  Whenever  convergent,  CGS  converges  more 
rapidly  than  BCG  and  GMRES. 

Furthermore,  it  has  been  shown  that  the  increase  of  the  fill-in  level  in  the  ILU-precondition  can 
raise  the  convergence  rate.  For  more  detailed  informations  see  [11]. 

3.3  Modelling  of  the  idealized  exciting  current  sources  in  the  FEM 

In  the  EMC  problems  of  the  power  systems,  the  electromagnetic  field  distribution  in  the  area  of  the 
exciting  current  sources  is  generally  not  required  to  be  exactly  computed.  Therefore,  the  exciting 
current  sources  can  be  simplified  into  some  kinds  of  ideal  current  sources,  e.g,  the  ideal  line  current 
and  the  ideal  current  sheet,  in  order  to  reduce  the  dimension  of  the  solution  problem. 

However,  the  surface  current  density  of  the  idealized  exciting  current  sources  is  infinite  because 
their  cross  area  is  zero.  One  approach  to  model  the  idealized  exciting  current  sources  in  the  FEM  is 
to  use  the  concept  of  the  Dirac  Delta  function  [12].  For  example,  the  ideal  line  current  source  can  be 
expressed  by: 


where  is  the  given  ideal  line  current  (Ampere),  6^  the  Dirac  Delta  function  and  P^(Xp  y^)  the 
coordinate  of  the  position  of  the  ideal  line  current. 

The  ideal  current  sheet  density  can  be  mathematically  represented  in  terms  of  the  contour 
integral  of  the  ideal  line  current  source,  i.e.: 


where  is  the  contour  on  which  the  ideal  current  sheet  source  j^(A/m)  is  distributed. 

Numerical  examples  can  be  found  in  [12],  which  shows  the  high  accuracy  of  this  approach. 

3.4  Adaptive  mesh  generation 

The  solution  accuracy  is  greatly  dependent  on  the  domain  discretization.  The  adaptive  mesh 
generation  is  such  a  strategy  that  the  mesh,  beginning  from  any  prescribed  starting  one,  is 
automatically  and  adaptively  refined  by  means  of  a  Posterior  error  estimator.  The  adaptive  mesh 
generator  used  in  this  software  [13]  is  based  on  the  error  estimator  of  the  Bank  &  Weiser  method 
[14]. 
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3.5  Calculation  of  flux  density  with  superconvergence  method  in  post-processing 

It  has  been  proven  that  the  global  convergence  rate  of  the  vector  potential  is  of  0(h^  for  the  first 
order  triangular  elements  when  h  denotes  the  mesh  size.  However,  the  first  order  derivatives  such  as 
flux  density  show  the  convergence  of  0(h)  [15].  In  the  simulation  of  the  EMC  problems  of  the  cable 
systems,  the  mesh  is  strongly  concentrated  near  the  plate  because  the  variation  of  the  magnetic  field 
in  the  ferromagnetic  plate  and  its  neighborhood  is  much  greater  than  that  in  the  space  above  the  earth 
surface,  in  which  the  flux  density  is  interested.  Thus,  the  accuracy  of  the  flux  density  in  the  space 
above  the  earth  surface  is  not  sufficient. 

In  the  literature  [15],  a  post  processing 
operator  of  superconvergence  property  was 
proposed.  The  idea  of  this  post  processing 
operator  is  to  use  an  analytical  solution  to 
calculate  the  first  order  derivatives.  This 
analytical  solution  is  the  solution  of  the 
Laplace  equation  tsu  =  0  in  a  small  circle 
surrounding  the  point  at  which  the  flux 
density  is  evaluated,  together  with  the 
boundary  condition  on  the  circumference, 
on  which  the  vector  potentials  are  know 
from  the  numerical  FEM  calculation. 

One  example  is  shown  in  Fig.  2,  in 
which  the  distribution  of  the  flux  density 
directly  evaluated  from  the  Hybrid  FE-BE 
computation  is  compared  with  that  obtained 
by  using  the  above  post  processing  operator 
of  superconvergence. 

4.  Application  Examples 
4.1  Shielding  of  a  single  phase  cable  system  with  an  iron  plate 

Fig.  3  shows  the  equipotential  map  of  an  experimental  model  in  which  the  magnetic  field  excited  by 
a  single  phase  cable  system  is  screened  by  a  thin  iron  plate. 


Figure  2  Comparison  of  the  induction  amplitude 
(Example  model:  Fig.  3,  y  =  1  m) 


Figure  3  Equipotential  map  (cot=0) 

Shielding  effect  of  a  iron  plate  on  the  magnetic  field  excited  by  a  single  phase  cable  system 
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42  Power  cable  systems  near  steel  materials 


Near  the  power  cable  systems  there  often  exist  the  ferromagnetic  materials,  for  example,  the  power 
cables  are  located  on  one  side  of  a  bridge  over  a  river.  As  an  example,  the  magnetic  field  of  a  three- 
phase  power  cable  system  near  a  steel  girder  is  shown  in  Fig.  4. 


Figure  4  Equipotential  map  (cat=0) 

A  three-phase  power  cables  near  a  steel  girder 

43  Shielding  effect  of  double  iron  plates 

Two  three-phase  systems  are  usually  adopted  in  the  underground  power  cable  systems.  In  these 
systems  the  cables  are  embedded  underground  e.g.  more  than  2  m.  Thus,  a  double  plates  shielding 
system  can  be  applied  to  improve  the  screen  effect.  One  example  is  shown  in  Fig.  5,  in  which  both 
edges  of  the  plates  are  crooked  to  reduce  the  edge  effect,  i.e.  the  magnetic  field  near  the  top  of  the 
plate  edge  is  increased  because  of  the  eddy  currents. 
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5.  Design  of  Shielding  Systems  by  Means  of  Numerical  Simulations 

The  shielding  system  can  be  designed  by  means  of  the  numerical  simulation.  For  the  cable  system  of 
a  two  three-phase  cable  system,  see  Fig.  6,  the  dependency  of  the  maximal  values  of  the  screen  effect 
B, 

{SE  =  20 log dB,  where  Bj  and  are  the  induction  amplitude  with  and  without  shielding  plate 
respectively),  the  induction  amplitude  and  the  eddy  current  loss  on  the  distance  h  is  shown  in  Fig.  7. 


Figure  6  Configuration  of  the  cable  system  (I  =  1602  A,  unit:  m) 


Distance  h  (m) 

Figure  7  Dependency  of  screen  effect,  induction  and  eddy  current  loss  on  distance  h 
(x  =  -2.5  •••  2.5) 


It  can  be  seen  that  the  maximal  induction  amplitude  at  y  =  0.0031  m  can  exceed  the  limit  value 
of  30  fiT  when  the  plate  is  placed  on  the  ground  surface. 

As  a  fact,  the  screen  effect  is  strongly  dependent  on  the  plate  width  and  thickness.  The  plate^ 
thickness  should  be  3  times  the  skin  depth  [4].  Furthermore,  it  is  important  to  note  here  that  a 
practical  magnetic  shielding  system  is  also  related  with  a  lot  of  other  factors,  e.g.  economical  and 
construction  consideration. 
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6.  Conclusions 

A  strategy  to  analyze  and  design  the  shielding  systems  of  the  underground  power  cable  systems  by 
using  the  Hybrid  FE-BE  method  is  introduced.  Some  of  the  features  to  treat  the  nonlinear  eddy 
current  problems  are  reviewed.  The  examples  have  shown  the  effectiveness  of  the  developed  method. 


7.  References 

[1]  E.  David  and  M.  Palic,  Elektrosmog-Wirkungen  elektromagnetischer  Felder  auf  den  Menschen,  ETZ, 
No.3,  1994,  134-139. 

[2]  R.  Belmans  and  W.  Geysen,  Environmental  implications  of  high  voltage  overhead  lines,  Proc.  of  the  IS.th 
Congress  of  World  Energy  Council,  Madrid,  Sept.  20-25,  1992,  71-87. 

[3]  H.-J.  Hanbrich,  Der  Mensch  im  elektischen  und  magnetischen  Niederfrequenzfeld,  ETZ,  No.3, 1994, 128- 
133. 

[4]  J.  Shen  and  A.  Kost,  Hybrid  FE-BE  method  for  EMC  problems  in  power  cable  systems,  to  be  presented 
at  COMPUMAG’95,  July,  1995. 

[5]  S.  J.  Salon  and  J.  M.  Schneider,  A  Hybrid  Finite  Element-Boundary  Integral  Formulation  of  the  eddy 
current  problem,  IEEE  Trans,  on  Magn.,  Vol.  18,  1982,  461-466. 

[6]  T.  Onuki,  Hybrid  Finite  Element  and  Boundary  Element  method  applied  to  electromagnetic  problems, 
IEEE  Trans,  on  Magn.,  Vol.  26,  1990,  582-587. 

[7]  J.  Shen,  Computation  of  linear  and  nonlinear  eddy  currents  with  the  Boundary  Element  method,  Ph.  D. 
thesis,  Berlin,  University  of  Technology,  Germany,  1994. 

[8]  J.  D.  Lavers,  Finite  Element  solution  of  nonlinear  two  dimensional  TE-mode  eddy  current  problems,  IEEE 
Trans,  on  Magn.,  Vol.  19,  1983,  2201-2203. 

[9]  Y.  D.  Terrail,  J.-C.  Sabonnadiere,  P.  Masse  and  J.  L.  Coulomb,  Nonlinear  complex  Finite  Elements 
analysis  of  electromagnetic  field  in  steady-state  AC  devices,  IEEE  Trans,  on  Magn.,  Vol.  20,  1984,  549- 
552. 

[10]  0.  Biro,  K.  Preis  and  K.  R.  Richter,  Various  FEM  formulations  for  the  calculation  of  transient  3D  eddy 
currents’  in  nonlinear  media,  Proc.  of  CEFC’94,  to  be  published  in  the  May  1995  issue  of  IEEE  Trans,  on 
Magn.. 

[11]  J.  Shen,  T.  Hybler  and  A.  Kost,  Comparative  study  of  iterative  solvers  for  the  complex  and  unsymmetric 
system  of  equations,  to  be  presented  at  COMPUMAG’95,  July,  1995. 

[12]  J.  Shen  and  A.  Kost,  Modelling  of  the  idealized  current  sources  in  the  FEM,  Proc.  of  CEFC’94,  to  be 
published  in  the  May  1995  issue  of  IEEE  Trans,  on  Magn., 

[13]  L.  Janicke,  Finite  Elemente  Methode  mit  adaptiver  Netzgenerierung  fur  die  Berechnung  dreidimensionaler 
elektromagnetischer  Felder,  Ph.  D.  thesis,  Berlin  University  of  Technology,  Germany,  1994. 

[14]  R,  E.  Bank  and  A.  Weiser,  Some  a  Posterior  error  estimators  for  elliptic  partial  differential  equations. 
Mathematics  of  Computation,  Vol.  44,  1985,  283-301. 

[15]  M.  Kasper  and  J.  Franz,  Highly  accurate  computation  of  field  quantities  and  forces  by  superconvergence 
in  Finite  Elements,  Proc.  of  CEFC’94,  to  be  published  in  the  May  1995  issue  of  IEEE  Trans,  on  Magn.. 


65 


NEW  CONTRIBUTION  TO  THE  STUDY  OF  FAULT  CURRENTS  DISTRIBUTION  IN  THE  GROUND 

SYSTEMS 

BrodsKyn,  H.  O.;  Giarolla,  M.  H.;  Cardoso.  J.  R.;  Abe,  N.  M.;  Passaro,  A.* 

Escola  Politecnica  da  USP 

Depto.  de  Engenharia  de  Energia  e  Automagao  Eletrica 
Av.  Prof.  Luciano  Gualberto,  Trav.  3,  158 
05508-900  -  Sao  Paulo  -  SP 

*  Instituto  de  Estudos  Avangados  -  lEAv/CTA 


SUMMARY 

In  previous  work  [1 ,2]  the  author  presents  a  methodology,  based  on  finite  elements  method  (FEM),  to 
calculate  the  potentials  distribution  on  the  grounding  area  influence.  At  that  time  the  input  to  the  solver  admits 
as  known  the  split  of  the  system.  The  aim  of  this  article  is  to  demonstrate  the  advantages  of  finite  element 
method  (FEM)  associated  with  power  systems  equations  with  lumped  parameters  to  intrinsically  obtain  the 
distribution  of  fault  current  in  several  offered  conductor  ways  and  the  potential  distribution  on  the  ground, 
simultaneously.  To  solve  the  complex  equations  system,  the  methodology  adopted  was  that  described  by 
Mesquita  [4].  That  solution  was  adopted  because  of  its  efficiency  and  easy  to  adapt  to  the  FEM  software 
implemented. 

1.  INTRODUCTION 

This  work  describes  a  method  to  calculate  the  distribution  of  currents  between  ground  and  over  head 
ground  wires  when  a  line  ground  fault  occurs  in  a  transmission  line  as  well  as  the  potentials  grades  that  had 
been  developed  in  the  earth. 

Most  of  works  had  developed  techniques  that  put  emphasis  on  one  of  two  points  bellow: 

-  Calculation  of  lumped  parameters  and  use  traditional  network  studies,  with  simplifications  mainly  on 

the  representation  of  the  ground  resistance  and  their  mutual; 

-  Analyses  based  on  empirical  graphics. 

This  kind  of  phenomena  just  on  the  few  last  years  started  to  be  solved  using  FEM  techniques  [1],  but 
even  in  this  case  this  solution  starts  from  the  hypotheses  of  fault  current  distribution  known. 

Our  proposal  is  to  mix  both  FEM  techniques  on  the  ground  and  lumped  parameters  of  the  over  ground 
network  as  well  the  mutual  between  line  and  ground  cables  in  the  same  solution  using  a  global  complex  matrix. 

The  computation  in  the  frequencies  domain  brings  a  new  contribution  to  the  study. 

The  application  of  the  Incomplete  Cholesky  Complex-Bi-ConJugate  Gradients  method  (ICCBCG)  was 
identified  as  the  most  appropriated,  in  the  face  of  encouraging  results  obtained  until  the  moment,  as  well  as 
existing  symmetry  in  the  resultant  matrix. 

In  the  developed  methodology,  the  domain  is  subdivided  in  two  parts: 

1.  soil  and  buried  elements; 

2.  electric  network,  as  show  on  fig.  1 .1 . 
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Fig.  1.1  -  Schematic  Model  of  the  Study  Domain 


The  matrix  that  simulates  the  system  will  be  the  composition  of  the  global  matrix  arising  from 
mathematics  formulation  for  the  FEM  to  the  ground  system,  with  the  insertion  of  the  network  lumped 
parameters,  resulting  in  a  system  of  equation  like: 

Ax=b  (Eq.  1.1) 

where  A  is  the  complex  matrix,  resultant  of  the  global  matrix  composition  of  the  FEM  with  insertion  of  lumped 
parameters  of  the  fault  current  dispersal  system,  x  is  the  scalar  potential  of  all  domain  nodes,  and  b  it  is  the 
impress  current  vector. 

The  interest  to  know  with  better  detail  the  distribution  of  current  in  the  multiconnected  earth  system  is 
that  of  minimize  investment  on  ground  grid  as  well  as  optimize  its  design.  As  by-product,  the  used 
methodology  allows  the  study  of  transference  potentials  in  metallic  elements  not  directly  linked  up  with  the 
electric  system,  but  they  are  situated  in  their  influence  area  (problem  domain)  like:  plumbing,  metallic 
structures,  armors,  with  they  have  contact  or  they  are  buried  In  the  soil,  in  the  moment  of  incident  of  the  line  to 
ground  fault. 

2.  CONSTRUCTION  OF  THE  COMPLEX  MATRIX 

Following  [5]  we  can  simplify  the  network  model,  if  there  are  presented  just  line  and  over  ground  wire 
in  a  transmission  line,  to  the  follow  scheme: 


Fig.  2.1  -  Impedance  model  of  lumped  parameters. 
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The  points  A  and  B  are  examples  of  interface  between  model  with  lumped  parameter  and  the  ground 

(FEM). 

If  we  change  in  the  over  seen  model  the  Zmg  impedance  by  current  generator: 

lg  =  ^(3lo)  (Eq.2.1) 

the  interconnection  of  two  substations  can  be  show  on  fig.  2.2.: 


Fig.  2.2  -  Interlink  of  two  ground  networks  of  substations. 

The  construction  of  the  Matrix  Is  done  Including  in  the  FEM  matrix  [G]  the  nodes  introduced  by  the 
links  with  the  lumped  parameters.  To  do  that  the  impedance  of  the  interconnections  between  the  tower  fop  and 
the  ground  electrode,  and  the  self  impedance  of  the  over  ground  wire  will  be  treated  as  one  dimension 
elements  to  be  include  in  the  main  matrix  as  show  bellow: 


Tower  interconnections  series  impedances.  Elements  of  parallel  impedance. 
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TGI  after  lumoed  elements  included. 
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The  tower  connections  with  the  over  ground  wires  will  be  considered  as  resistive  so  the  final  matrix 
will  be  symmetric  with  complex  elements. 

This  algorithm  Is  just  the  inclusion  of  a  element  in  an  already  build  Y  matrix  as  show  bellow  [6]: 


q 

Eq 


Eq  =  Ep  -  vpq 


3.  SOLUTION  METHODOLOGY 

So,  to  solve  the  problem  defined  before,  we  have  the  following  equation: 


[G] .  [V]  =  [  I  ] 


(Eq.3.1) 


where  G  is  symmetric. 

To  solve  this  equation  it  wiil  be  used  the  method  of  Bi-Complex-Conjugate  Gradient[3].  This  method, 
shown  on  the  algorithm  III  of  Jacobs  article,  is  generic  and  can  solve  the  more  simple  situations  when  them 
arise  as  the  A  matrix  real,  positive  defined  and  symmetric,  falling  in  the  CG  method.  The  Bi-Conjugated 
method  explore  the  minimization  of  the  residue  o  r,  To  =  b  -  Axo ,  where  x  is  the  system  solution,  minimizing 
the  function,  that  on  the  CG  method  is  r"'"  A"'*  r  and  on  the  Bi-CG  real  is  (  A  +  )  x,  minimizing  the 

calculus  effort  compared  to  the  form  A’^Ax  =  A’^b.  In  this  way  the  minimum  of  r  will  be  searched  in  two 
orthogonal  directions  in  the  space  defined  in  the  problem  fj  and  fj. 

"Unfortunately  on  the  Bi-CG  system  is  not  possible  to  prove  that  the  residual  norm  is  strictly 
decrease  at  each  step  of  the  algorithm  for  a  general  complex  matrix,  as  can  be  proved  for  CG.  In  special  case 
of  a  Hermetian  matrix,  it  is  possible  to  prove  the  strict  monotonicity"  [3]. 

The  Complex  Bi-Conjugate  Gradient  Method 


Ax  =  b 


(Eq.  3.2) 


Given:  A  -  a  non-singular  matrix  with  complex  elements  of  order  n; 
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b  -  the  vector  of  right-hand  sides  of  order  n; 

Xo  -  an  Initial  estimate  of  the  solution 

The  procedure  bellow  takes  into  account  the  Jacobs  [3]  on  the  Bi-CG  system  adopted  in  the 

space; 


ro  =  b-Axo 

(Eq.  3.3) 

ro  =  r*o 

(Eq.  3.4) 

Po  =  ^0 

(Eq.  3.5) 

Po 

(Eq-  3.6) 

for  i  =  0,1,2,... 

(r/,  Pi) 

(Pi,  Api) 

(Eq.  3.7) 

Xh-i  =  X/  -f  (Xj  pi 

(Eq.  3.8) 

=  T/  —  (Xf  Api 

(Eq.  3.9) 

=?i-a*A^pi 

(Eq.  3.10) 

(A^pi,  ffn) 

(Pi.  Api) 

(Eq.3.11) 

PA^i=r^i+P/P/ 

(Eq-  3.12) 

pH'1  =  ^H-1  +  P/  Pi 

(Eq.  3.13) 

continue  until  =0 

The  bi-orthogonality  relationships  satisfied  by  the  various  vectors  are: 


(Pi,  Apj)  =  0 

for  /  >  j 

(Eq.3.14) 

(7/,  0)  =  in,  Tj)  =  0 

for  /  >J 

(Eq.  3.15) 

(Pi,  A^pj)  =  0 

for  / >  j 

(Eq.  3.16) 

(fi,  Pj)  =  (n,  Py)  =  0 

for  /  >J 

(Eq-  3.17) 

Notes:  *  complex  conjugate; 

u 

transposed  of  the  vector  or  matrix  of  complex  conjugate  elements. 
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(x,y)  =  x'^.y 
(x,y)  =  (y,x) 

Special  Cases: 


(Eq.  3.18) 
(Eq.  3.19) 


-  If  A  is  Hemnetian,  that  is  =  A,  then  the  complex  Bi-CG  algorithm  becomes  the  complex  form 
of  the  basic  CG  algorithm  just  as  one  would  expect  since  Hermetian  is  the  analogue  to  symmetry. 
The  analogous  result  as  for  the  basic  CG  algorithm  holds  on  the  strict  monotonic  of  the  residual  norm 
per  step; 

-  If  the  system  is  real  then  Po  =Po  e  =^o  and  we  have  the  Bi-CG  real  algorithm; 

-  If  the  system  is  real  and  the  matrix  A  is  positive  definite  and  symmetric  we  have  the  CG  algorithm. 

As  described  by  Mesquita  [4]  when  the  matrix  A  is  symmetric  and  complex,  we  can  use  the 
Incomplete  Choleski  method  before  applying  the  Bi-CCG,  making  the  ICCBCG  (Incomplete  Complex 
Bi-Conjugate  Gradients)  that  introduce  reductions  on  the  number  of  iterations  to  obtain  the  problem  solution. 

On  the  pre-conditioning  Incomplete  Cholesky  method  the  matrix  is: 

C=r.T^  (Eq.3.20) 


where  T  is  the  low  triangle  of  the  matrix. 
The  ICCBCG  method: 


ro  =  b-Axo 

(Eq.  3.21) 

* 

il 

o 

(Eq.  3.22) 

o 

II 

o 

(Eq.  3.23) 

o 

N 

o 

(Eq.  3.24) 

C"Zo=?o 

(Eq.  3.25) 

po  =  Zo,  /=0 

(Eq.  3.26) 

Cn.  Zi) 

'  (Ph  Ap/) 

(Eq.  3.27) 

-^H-1  ~  Xi  +  (Xi  pi 

(Eq.  3.28) 

rH-1  =  T/  —  (XiApi 

(Eq.  3.29) 

^H-l  “  (^/)  A^ Pi 

(Eq.  3.30) 

I|/'h-iII<£  ^  stop 

II 

(Eq.  3.31) 

(7h-i  1  -^^1 ) 

P'  "  Cn.Zi) 

(Eq.  3.32) 
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p^i  =  Z^i  +  Pi  Pi 

(Eq.  3.33) 

C^Zi^x  =  rM 

(Eq.  3.34) 

pH-1  =  Z/f1  4*  (p/)  Pi 

(Eq.  3.35) 

/  =  /  .f.  “I  — ^  go  to  Eq.  3.27 

4.  EXAMPLE  AND  RESULTS 

On  the  first  approach  we  will  utilize  the  simplified  example  of  figure  4.1. a,  with  DC  current  of  1500  A 
on  point  2. 


The  solver  had  considered  the  elements  between  the  points  2-13,  13-14,  e  14-5  as  lumped  values, 
and  the  rest  of  domain  had  been  simulated  with  FEM  and  ICCG  (Incomplete  Cholesky  Conjugate  Gradients) 
solution. 


0.3 


0,03S^| 
1500  A I 


:  0,03  Q 


1 


0,08334  n 


-AW - 

0,08334  £2  9 


Fig.  4.1. b  -  Equivalent  Model 


The  results  are  those  below: 


V,  =  226.4793  V 

Vj  =  226.5250  V 

V3  =  226.4539  V 

V,  =  226.4691  V 

V,  =  124.9921  V 


Ve  =  124.9714  V 

V,  =  124.9691  V 

V3  =  124.9757  V 

V,  =0 

=0 


V,  =0 

V,,  -0 

V,3  =218.0702  V 

V,,  =  133.4464  V 


For  verification  lets  take  a  equivalent  system  where  the  FEM  domain  will  be  simulated  by  lumped 
parameters  too. 

Considerations: 

-  Distance  between  two  points:  1m 

-  Soil  Resistivity:  =  1/12  £2. AW 

-  Resistances:  element  between  points  2-13:  0,03 

element  between  points  13-14:  0,3^2 


72 


element  between  points  14-5:  0.03  Q 

Then, 

Rsoio  =  P7  =  1/12  *  1  =  0,08334  O 

The  current  is  uniformly  distributed  in  the  1-2-3-4  area. 

So,  we  have  the  solution  of  the  equivalent  model  of  figure  4.1  .b  : 
V,  =  226.5  V  Vs  =  125  V  =  0 

To  the  other  result  we  will  use  the  example  of  the  fig.  4.2: 


The  ground  rods  of  the  mesh  and  of  the  towers  are  6.0  m  long,  the  mesh  is  buried  at  0  m  level,  and 
the  soil  is  stratified  in  three  layers  following: 

Om  -3m:100a.m  -3m  -9m:  400  Q.m  -9m  -15m:  1300  Q.m 

Fig.  4.3,  shows  the  equipotential  lines  distribution  due  to  a  total  ground  fault  (1=  40  A)  imposed  in  the 
left-side  mesh. 
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Fig.  4.4.a  shows  the  potential  profile  in  the  direction  (A-A’)  as  show  in  fig.4.2,  and  the  fig.  4.4. b  shows 
the  potential  surface  at  0  m  level. 


Fig.  4.4.a  -  Potential  profile 
5,  CONCLUSION 


Fig.4.4.b  -  Potential  Surface 


Although  we  had  used  just  real  domain  and  iCCG  solution  with  no  very  sofisticated  discretization  of 
the  soil  domain,  until  now,  the  results  show  viability  of  the  solution  global  matrix  with  lumped  network 
parameters  and  FEM  matrix,  as  can  be  seen  in  the  examples  ^ 
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ON  THE  OSCILLATORY  PHENOMENA  OF  EDDY  CURRENTS 
ALONG  THE  A,  V-^P  INTERFACE 


Z.  Cheng,  Q.  Hu,  S.  Gao,  Z.  Liu,  M.  Wu,  C.  Ye 
Baoding  Transformer  Works,  Baoding  071056,  China 

ABSTRACT  This  paper  investigates  the  oscillatory  phenomena  of  the  solution  of  eddy 
currents  along  the  interface  between  the  A,  V  conductor  and  the  T-current-free  r^on 
in  the  A,  V-T  approach,  and  the  schemes  to  remove  those  strange  numerical  behaviors 
have  been  proposed  and  carried  out  . 


1.  OSCILLATORY  PHENOMENA  OF  J  IN  A,  V-T  INTERFACE 

The  general  multi-subr^on  eddy  current  problems  are  often  encountered  in  the  design 
of  electromagnetic  devices,  various  advanced  analysis  facilities  have  been  developed  and 
verified  in  every  aspects.  As  a  way  out  to  cope  with  the  multi-subregion  problem 
involving  the  multiply  connected  conductors  the  combined  use  of  different  vector  and/or 
scalar  potential  sets  by  no  special  numerical  dealing  was  investigated  [1]. 

In  the  combined  version  the  T:  electric  vector  potential;  T:  magnetic 

scalar  potential)  and  the  A—V  — A  (  A:  magnetic  vector  potential;  V:  electric  scalar 
potential)  as  two  basic  potential  sets  [5]  were  alternatively  used  in  the  different 
subregions.  It  aims  to  make  use  of  the  complementary  advantages  of  every  analysis  methods 
to  be  combined  and  avoid  their  shortcomings  when  used  individually,  to  simplify  the  nume¬ 
rical  implementations,  and  to  reduce  the  computing  cost.  Since  the  numerical  solutions  of 
the  same  field  problem  by  the  different  analysis  tools  may  not  approach  to  the  exact  so¬ 
lution  in  the  same  trends  [6].  It  is  also  expected  to  improve  those  numerical  solutions  of 
both  the  eddy  current  ~J  and  the  magnetic  flux  densities  B. 

However,  the  oscillatory  phenomena  of  the  solution  of  eddy  current  7  have  been  found 
along  the  interface  between  the  conductor  described  by  the  A,  V  set  and  current-free  re¬ 
gion  in  which  T  scalar  potential  was  employed  only. 

We  have  noticed  that  0.  Biro  and  A.Kameari  have  pointed  out  the  oscillatory  phenomena 
of  the  magnetic  flux  densities  B  occurred  on  the  interface  between  A  and  T  for  magne¬ 
tostatic  case  [3, 4],  O.Biro  has  obtained  an  improved  solution  of  magnetic  flux  densities 
by  setting  up  the  A  —  T  boundary  outside  A  —conductor  [3]. 

2.  ON  THE  DUALIZED  NUMERICAL  TEST 

To  study  the  numerical  behaviours  of  the  combined  potential  sets  a  dualized  numerical 
test  has  been  proposed  [2],  that  is,  two  simply  connected  conducting  plates  of  high  per- 
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meability  were  driven  by  a  time  varying  and  uniform  magnetic  field,  and  the  X  ~  V  and 
the  T  — 'F  potential  sets  were  adopted  in  the  different  plates,  respectively,  as  shown  in 
Fig.  1. 


Hs  Hs 

Fig.  1  The  dualized  test  model 
Note:  Here  J  a  and  J  a'  are  the  corresponding  positions 


Due  to  the  symmetry  of  the  testing  conditions  described  above  the  distributions  of 
eddy  currents  induced  in  the  two  plates  should  be  the  same.  However,  the  numerical  test 
results  obtained  by  the  combined  code  have  shown  that  the  distributions  of  eddy  current 
in  the  X,  V  conductor  become  strange,  especially,  along  the  boundary  of  the  X,  V  condu¬ 
ctor,  as  shown  in  Fig.  2(a). 


(a)  without  penalty-plus  (b)  with  penalty -plus 

factor  (  Ca=l  )  factor  (  Ca=33.  e3  ) 

Fig.  2  Distributions  of  eddy  currents  caused  in 
X,  V  and  T,  ^  conductors 
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Notes: 

1)  Number  of  brick  elem.  =  1386 

2)  Number  of  unknowns  -  3758 

3)  Number  of  nodes  =1840 

4)  The  upper  plate:  T,  ^  set 
The  lower  plate:  A,  V  set 

5)  /zr  :  relative  permeabihty  of  steel  plate 
here  /zr  =  500. 

6)  PICCG  solver  used 

3.  ON  THE  PENALTY-PLUS  FACTOR 

In  the  combined  use  of  the  A,  V  —  A  — 'P  and  the  T,  'P— ^  presented  here  the  Coulomb 
gauge  was  imposed  in  forms  of  penalty-function  term  to  the  curl-curl  equations, 

VX— vxT-  {v(— V  •  T)}  + 

cr  a 


1  Ca 

V  X— V  X  A-  {V(—  V  •  A  )} 
/z  Ai 


— /z  (T-V^)  -0  (1) 

9  t 

3  A  9  V 


Notice  that  here  Ca  was  introduced  into  the  penalty-function  term  within  (}  of  (l).we 
have  seen  that  Ca  does  enhance  the  "zero-divergence"  condition,  therefore  it  can  be  re¬ 
ferred  as  a  "penalty-plus  factor".  The  distributions  of  eddy  currents  can  be  dramatically 
improved  by  tacking  a  suitable  penalty-plus  factor,  see  Fig.  2(b). 

What  are  the  reasons  to  cause  the  strange  numerical  behaviours?  It  is  worth  to 
further  study.  According  to  the  numerical  tests  done  by  the  authors  it  can  be  summarized 
as  foUows: 

(1)  The  finite  element  system  of  the  A,  V-'P  is  symmetric  but  non-positive  definite 
and  badly  ill-conditioned,  and  the  ill-condition  of  the  system  will  be  sharpening  when  the 
permeability  of  the  conductor  become  high; 

(2)  The  "zero-divergence"  condition  of  the  Coulomb  gauge  is  poorly  realized; 

(3)  Both  the  continuity  of  Bn  and  Ht  (normal  component  of  magnetic  flux  densities  S 
and  tangent  component  of  magnetic  field  intensity  H)  are  weakly  satisfied  for  nodal  FEM. 

However, it  has  been  shown  that  both  the  ill-condition  of  A-V  submatrix  and  the 
numerical  divergence  of  A  can  be  considerably  improved.  Fig.  3  and  table  1  show  the  va¬ 
riation  of  the  sum  total  of  the  non-zero  numerical  divergence  of  A  at  the  centers  of  aU 
brick  elements  with  the  penalty-plus  factor  Ca. 
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Table  2 


Ca 

1 

0.58»10® 

25 

3. 30»10® 

50 

5.10*10® 

100 

8.70*10® 

200 

15. 00*10® 

300 

21. 40*10® 

400 

27. 50*10® 

500 

33. 00*10® 

Fig. 4  relation  of  Ca  and  fit 

The  numerical  test  also  showed  that  the  relation  between  the  iteration 
PICCG  and  the  relative  permeability  of  the  conductor  ixt,  as  shown  in  Fig.  5. 


Fig.  5  relation  between  PICCG 
and  /zr 


number  of 
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4.  CONCLUSIONS 


The  dualized  numerical  tests  proposed  in  the  paper  have  been  carried  out  to  show  the 
strange  phenomena  of  the  solution  of  eddy  currents  when  run  the  combined  code,  and  those 
unexpected  numerical  phenomena  can  be  removed  by  taking  a  proposed  penalty-plus  factor 
in  Coulomb  gauging,  and  the  related  performances  about  the  penalty-plus  factor  Ca  have 
been  shown  in  the  paper. 
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Abstract:  A  new  MMP-code  for  static  field  com¬ 
putation  based  on  generalized  multipole  techniques 
(GMT)  has  been  implemented.  In  absence  of  time 
variations  MaxwelPs  equations  are  separated  in 
three  formally  equal  and  independent  systems  of 
equations  referred  to  as  the  electrostatic,  the  mag¬ 
netostatic  and  the  direct  current  system.  The  new 
program  handles  all  of  them. 

Different  types  of  expansion  functions  are  im¬ 
plemented:  regular  and  singular  multipole  functions 
for  2D  and  3D  as  well  as  charge  or  current  distri¬ 
butions  on  a  curved  thin  wire.  In  addition  to  the 
features  of  the  time  harmonic  version  both  scalar 
and  vector  potentials  are  available,  as  well  as  zero 
order  pole  functions  (point  charges,  inexistant  in 
the  time  harmonic  case  due  to  charge  conservation). 
Complex  field  components  are  reduced  to  real  ones 
and  thus  simplify  the  computation. 

In  electrostatics  both  the  total  charge  or  the  po¬ 
tential  of  electrodes  may  be  either  given  or  com¬ 
puted  (by  field  integration)  after  having  solved  the 
field  problem.  Additional  constraints  concerning, 
e.g.,  total  charge  can  be  applied. 

Static  solutions  may  serve  as  quasistaiic  expan¬ 
sion  functions  for  low  frequency  problems  and/or 
for  (electrically)  small  areas. 

Examples  out  of  electrostatics  (red  blood  cell 
deteteciion  using  an  Ohmmeter),  magnetostat¬ 
ics  (screening  effect  of  iron  rails)  and  quasistatics 
(foot  point  impedance  of  an  antenna)  show  the  wide 
range  of  application  of  the  new  static  MMP-code. 

1,  Introduction 

The  MMP- program  package  [1]  is  a  top  level  tool 
for  computational  electromagnetics  in  frequency  do¬ 
main  for  situations  smaller  than,  say  five  to  ten, 
wave  lengths  and  piecewice  homogeneous,  linear  and 
isotropic  materials.  It  is  possible  to  calculate  qua¬ 
sistatic  fields  by  specifying  a  very  low  frequency. 
However,  a  few  specific  and  typically  static  effects 
—  e.g.,  a  net  charge  on  an  insulated  electrode  —  are 
not  covered  with  this  approach. 


As  it  is  well  known,  MMP  expands  the  unknown 
electromagnetic  field 

(|)=(|;)4.(|;)  ... 

in  a  series  where  each  (^''j  is  a  solution  of  Maxwell’s 
equations,  i.e.,  the  electric  field  Ek  and  the  magnetic 
field  Hk  are  coupled  through  Maxwell’s  equations. 
As  a  matter  of  fact  this  coupling  becomes  very  weak 
at  low  frequencies.  MMP  fulfils  this  weak  coupling 
analytically  but  it  allows  numerical  errors  for  fulfill¬ 
ing  the  boundary  conditions.  In  almost  static  cases 
the  rigorous  respecting  of  the  very  weak  E  H 
coupling  turns  out  to  be  an  unnecessary  restriction. 
Using  two  independent  expansions  for  E  and  H  re¬ 
spectively  gives  more  freedom  to  satisfy  the  bound¬ 
ary  conditions.  Classic  MMP  has  not  only  an  up¬ 
per  frequency  limit  but  also  a  lower  one.  The  static 
MMP  program,  the  subject  of  this  paper,,  eliminates 
the  lower  frequency  limit. 

Note  that  in  principal  E  and  H  are  still 
coupled,  not  field^field  but  source+^source  (i.e., 
charge  <->current  through  the  charge  conservation 
principle).  However,  in  many  practical  applications 
the  sources  are  located  on  the  boundary  rather  than 
inside  the  field  domain.  The  classical  MMP  approach 
allows  errors  on  the  boundaries  but  forbids  errors 
in  the  domain.  Concerning  E  ^  H  coupling  this 
is  much  too  restrictive  at  low  frequencies.  The  (al¬ 
most)  inexistent  field+-^field  coupling  in  the  domain 
must  be  dropped. 

Moreover,  in  many  practical  cases  only  E  (or 
only  H  or  the  current  density  j  respectively)  are  re¬ 
quested.  The  remaining  fields,  though  existing,  need 
not  to  be  calculated  explicitely.  On  the  other  hand, 
potentials  of  the  fields  are  often  required  in  statics, 
in  particular  when  boundary  conditions  are  given  in 
terms  of  potentials  rather  than  in  terms  of  fields. 
This  leads  to  a  necessary  extension  of  the  MMP  pro¬ 
gram.  Of  course,  the  purely  static  procedure  allows 
one  only  to  perform  real  calculations  which  are  much 
more  efficient  than  MMP’s  complex  matrix  solvers. 
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2.  The  Static  Theory 

In  the  case  of  piecewise  homogenous,  linear  and 
isotropic  materials  (described  by  the  permittivity  £, 
the  permeability  /x  and  the  conductivity  cr,  the  static 
Maxwell’s  equations  in  the  field  domain  are 

rot  H  =  jo,  div  H  =  0,  (magnetostatics)(2~l} 
TotE  =  0,  divE  =  Qo/£^  (electrostatics)  (2-2) 
rot  j  =  0,  div  j  =  0.  (direct  current)  (2-3) 

The  only  formal  difference  between  the  three  static 
domains  are  the  right  hand  sides  in  the  equations. 
The  homogeneous  equations  (i.e.,  if  jo  =  0  and  qq  = 
0)  are  identical  for  both  H,  E  and  j.  The  main 
task  in  MMP  is  finding  an  expansion  for  the  solution 
of  the  homogeneous  equations.  Thus  we  can  treat  all 
static  cases  (electrostatics,  magnetostatics  and  direct 
current)  in  the  same  way.  We  write  F  for  the  field, 
look  for  a  solution  of 

rot  F  =  0 ,  div  jP  =  0  (3) 

and  may  state  that  F  has  both  a  scalar  potential  g 
and  a  vector  potential  G  with-^ 

F  =  rotG,  — gradp.  (4) 

Assuming  Coulomb  gauge  (div  G  =  0)  for  the  vector 
potential  all  these  functions  are  also  solutions  of  the 
Laplace  equations 

AF  =  0,  AG=0,  Aff  =  0.  (5) 

Thus,  a  general  static  MMP  expansion  /(r )  consists 
not  only  of  a  field  but  also  of  the  associated  poten¬ 
tials: 

{F{f)  vectorial  field  {E,  H ,  j. . .) 

G(f )  associated  vector  potential  (6) 
g{r)  associated  scalar  potential 

and  the  expansion  for  a  field  domain  D  is 

N 

/tot  =  /o  +  ^  dkfk ,  (7) 

jfc=l 

where  /o  covers  the  particular  (inhomogeneous)  so¬ 
lution^  while  the  ‘seven  dimensional’  real  spacefunc- 
tions  fk  satisfy  the  homogeneous  equations  (3)  to 
(6).  Note  that  optionally,  not  all  seven  components 
of  /  are  in  use  for  a  particular  problem.  E.g.,  in 

^  The  existence  of  both  (5  and  g  is  given  in  simply  connected 
field  domains  only. 

^  fo  need  not  to  have  both  scalar  and  vector  potential. 


magnetostatics  the  scalar  potential  is  not  defined  in 
the  presence  of  currents. 

The  boundary  conditions  to  be  fulfilled  on  the 
boundary  dDa  between  the  subdomains  D,-  and  Djt 
are  different  in  all  three  static  domains.  In  electro¬ 
statics,  we  have 

^tX  -  ^JbT  =  0,  SiEin  -  SkEkn  -  (8) 

where  the  indices  T  and  n  denote  the  respective  com¬ 
ponents  tangential  and  normal  to  the  boundary  and 
C  is  a  possible  surface  charge  density. 

In  magnetostatics,  we  have 

HiT  -  HkT  =  0,  —  flkHkn  =  0,  (9) 

and  in  the  case  of  direct  current,  it  is 

—jiT  -  —  jfcx  =  0,  jin  -  jjfcn  =  0.  (10) 

(^i  CTk 

AH  potential  components  are  continuous  across  a 
boundary,  as  long  as  the  same  gauge  and  the  same 
normalization  is  used  on  both  sides.  We  simply  have 

Gi-Gk  and  gi  =  gk  (11) 

on  the  boundary  5D,fc. 

3.  The  MMP  Implementation 

Similar  to  the  dynamic  case,  the  static  MMP  ap¬ 
proach  starts  with  an  expansion  (7)  in  each  sub- 
domain  D,-  and  sets  up  an  overdetermined  system 
of  equations  for  the  unknowns  a*  by  introducing 
the  expansion  in  the  respective  boundary  conditions 
(8)... (11)  being  evaluated  in  the  so-called  match¬ 
ing  points  on  the  boundary.  The  resulting  system  of 
equations  is  solved  in  the  least  squares  sense  using 
Givens  plane  rotations. 

3.1  The  Static  Boundary  Conditions 

A  straightforward  least  squares  technique  minimizes 
the  sum  of  the  squares  of  the  errors  of  each  equation. 
In  the  computer,  this  minimization  is  done  with  num¬ 
bers  rather  than  with  physical  quantities.  Due  to  the 
fact  that  fields  and  potentials  have  different  physical 
units,  numerical  errors  of  both  quantities  may  not  be 
compared  directly  with  each  other.  Further  on,  con¬ 
sidering  the  normal  conditions  (8)  in  the  special  case 
of  a  fictitious  boundary,  say  vacuum  at  both  sides, 
one  finds  that  the  numbers  in  the  right  equation  in 
(8)  are  essentially  sq  8.85410"^^  times  smaller 
than  those  in  the  left  equations  in  (8).  This  means 
that  the  mismatching  AEn  :=  Ein  -  Ekn  may  be  ten 
billion  times  larger  than  the  mismatching  of  the  tan¬ 
gential  E-components,  but  the  least  squares  routine 
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still  ‘sees’  a  larger  error  of  the  tangential  components. 
To  avoid  such  differences,  MMP  divides  the  bound¬ 
ary  conditions  (8)... (11)  by  appropriate  numbers. 
The  user  may  define  two  general  weighting  constants 
(K^s.v)  for  this  purpose,  one  for  the  scalar  potential 
and  one  for  the  vector  potential.  The  actual  equa¬ 
tions  to  be  fulfilled  in  a  matching  point  are-^ 


>^iFi 

•Ul 

-  XkFk 

Ul 

=  0, 

(12-1) 

XiFi 

•  U2 

-XkFk 

U2 

=  0, 

(12-2) 

KiFi 

'  Un 

—  KjtFfc 

Un 

=  0, 

(12-3) 

KyGi 

Ul  - 

-KyGk 

Wl 

=  0, 

(12-4) 

KyGi  • 

U2  - 

-KyGk- 

U2 

=  0, 

(12-5) 

KyGi  • 

Un  “ 

-  KyGk  • 

Un 

=  0, 

(12-6) 

Ks9i  -  K 

s9k 

=  0. 

(12-7) 

Thereby,  A,-,*  and  Ki^k  are  related  to  the  permittivi¬ 
ties  permeabilities  fii^k  and  conductivities 
ui  and  U2  are  tangential  unit  vectors  and  Un  is  the 
normal  unit  vector.  The  actual  values  for  the  con¬ 
stants  are: 


el.-stat. 

mag.-stat. 

current 

Ai  = 

1 

1 

Ajt  = 

1 

1 

Ki  = 

^/i 

1 

Kk  = 

fH 

Y 

1 

These  values  may  be  overridden  by  the  user  in  the 
same  way  as  it  may  be  done  in  the  dynamic  MMP 
program  [1,3]. 

Note  that  not  all  boundary  conditions  need  to  be 
used  simultaneously.  However,  MMP’s  least  squares 
technique  makes  it  reasonable  to  use  all  of  them  in 
so  far  as  the  related  quantities  occur  in  the  actual 
problem. 

3.2  Available  Expansion  Functions 

As  expansion  functions,  2D-  and  3D-multipoles,  2D- 
and  SD-regular  expansions  as  well  as  line  charge  seg¬ 
ments  and  line  current  segments  are  currently  im¬ 
plemented.  Note  that  different  coordinate  origins 
deliver  new  functions.  MMP  uses  several  origins  si¬ 
multaneously  in  typical  applications.  The  formulae 
for  the  2D-multipoles  of  m-th  order  are  (cylindrical 


^  The  surface  charge  density  c  is  supposed  to  be  zero. 


coordinates  p,  0  and  z): 


Fp 


^  cos 
pm+i  sin 


(m^), 

-cos'' 


F, 

Gp 


=  0, 

=  0. 


C?0  =  O, 


3 


(14) 


The  corresponding  regular  expansions  are  very  sim¬ 
ilar,  but  with  positive  powers  of  p.  [3] 

The  formulae  for  the  3D-multipoles  of  n-th  order 
and  m-th  degree  are  expressed  in  spherical  coordi¬ 
nates  r,  0  and  </>: 


F>  =  (("  + 

n  cos  0p;p  {cos  6)^ 

Gr  =  0,  (15) 

Ge  =  ^P-{oose)_Z(m^), 

G*  =  (("  + 

n  cos  OP^  {cos  9)^ 


Thereby  is  the  associated  Legendre  polynomial 
and  PII^{cos6)  :=  P^(cos  ^)/sin0  is  used  to  avoid 
numerical  instabilities  close  to  the  2-axis.  Again,  the 
corresponding  regular  expansions  are  very  similar, 
but  with  positive  powers  of  r. 

The  electric  field  of  a  straight  homogeneously 
charged  line  segment  has  the  following  form.  The 
segment  lies  on  the  z-axis  (between  z  =  0  and  z  —  1) 
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and  has  the  length  i 


3.3  Further  Features 


r  —  z 


For  simplicity's  sake,  the  cylindrical  coordinate 
p  =  -\-y^  and  the  spherical  coordinate  v  = 

^  y2  _j-  ^2  jiave  been  used.  Note  that  G  is  singu¬ 
lar  on  the  whole  ^-axis.  This  means  that  the  vector 
potential  of  this  function  should  not  be  used  if  (a 
part  of)  the  z-axis  is  in  the  field  domain. 

The  field  of  a  constant  current  in  the  same  seg¬ 
ment  is  more  sophisticated.  The  magnetic  field  H 
and  the  associated  vector  potential  A  (with  rot  A  = 
pH)  may  be  formally  evaluated  using  Biot-Savart’s 
integral: 


However,  this  is  a  highly  nonphysical  field:  The  curl 
equation  (3)  is  not  satisfied.  Nevertheless  we  can 
use  it,  but  only  in  conjunction  with  many  segments 
forming  a  closed  loop  all  together.  The  scalar  poten¬ 
tial  of  the  field  of  a  single  current  segment  does  not 
exist. 

In  static  MMP  long  curved  lines  built  up  by 
an  arbitrary  number  of  segments  may  be  defined. 
Thereby,  the  line  specification  is  the  same  as  that 
for  the  curved  line  multipoles  [2,3]. 

Beside  these  basic  expansion  functions,  there  are 
the  ‘connections’  —  essentially  expansion  functions 
built  up  by  a  (possibly  large)  number  of  other  expan¬ 
sion  functions.  The  general  procedure  for  building 
connections  is  the  same  as  in  time  harmonic  MMP 

[1,3]. 


Most  of  the  special  features  of  the  classic  (time  har¬ 
monic)  MMP  code  are  also  available  in  the  static 
version.  We  mention  the  symmetry  feature  (up  to 
three  symmetry  planes  may  be  specified),  the  con¬ 
straints  (additional  equations,  often  related  to  line  or 
surface  integrals  of  field  quantities)  and  the  integrals 
(an  output  feature  to  obtain  integral  quantities  such 
as  total  field  energy  etc.).  A  specifically  static  con¬ 
straint  is  the  total  charge  of  an  electrostatic  model: 
it  may  be  specified  by  the  user. 

Both  input  and  output  may  be  performed  using 
the  mmptool  [4],  including  all  geometric  data  and 
the  automated  choice  of  the  expansion  functions  (2D- 
and  3D-multipoles  only).  However,  at  present  state 
the  mmptool  has  not  been  adapted  to  statics  yet. 
Thus,  a  few  lines  of  particular  static  input  data  must 
be  added  to  the  input  file  using  a  text  editor. 

3.4  Quasistatic  Calculations 

Static  models  are  theoretical  models.  No  practical 
application  helds  the  same  field  unchanged  for  an  in¬ 
finite  amount  of  time.  However,  very  slowly  varying 
fields  show  the  same  spacial  dependency  as  the  static 
field.  E.g.,  we  may  write 

Eex(r,t)«^stat(r)-r(t)  (18) 

where  the  indices  ‘ex’  and  ‘stat’  stand  for  exact  and 
static  respectively  and  T{t)  is  a  (slow^)  time  func¬ 
tion.  Thus,  static  MMP  solutions  may  be  used 
as  special  expansion  functions  in  classic  (time  har¬ 
monic)  MMP.  Let  us  describe  the  procedure  with  an 
example.  Suppose  the  field  in  the  feeding  area  of  an 
antenna  is  calculated  by  means  of  static  MMP  with 
the  results 

Est^t{r)  =  EQ{f)  +  ^ajfc^jb(r),  (19-1) 

fc=i 

Njj, 

^stat(r)  =  Hoin  +  (19-2) 

jfc=i 

with  real  parameters  a*,  /?*.  These  solutions  are 
taken  as  a  connection  [1,3]  in  the  expansion  of  the 
time  harmonic  antenna  problem,  i.e.,  the  time  har¬ 
monic  expansion  for  the  feeding  area  consists  of  two 
members: 


where  ai  and  a 2  are  complex  unknowns  of  the  full 
problem  and  the  expansion  of  the  outer  area  is  of 

i.e.,  the  field  domain  is  small  compared  to  the  minimtim 
wave  length  related  to  T{t). 
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the  form  (1).  Note  that  the  two  terms  in  (20)  are 
not  classical  MMP  expansion  functions  as  those  in 
(1)  because  they  satisfy  Maxwell’s  equations  only 
approximately  and  only  as  a  pair.  Nevertheless  ui 
and  02  are  independent  and  will  be  determined  by 
matching  the  feeding  area  expansion  (20)  with  the 
outer  area  expansion  of  the  antenna.  NB:  The  ratio 
01/02  is  closely  related  to  the  foot  point  impedance 
of  the  antenna. 

A  special  version  of  the  MMP  code  has  been 
worked  out  to  allow  (pairs  of)  static  solutions  as  con¬ 
nections. 

4.  Examples 

We  treat  three  typical  examples  to  illustrate  the  ca¬ 
pabilities  of  the  new  static  and  the  quasistatic  MMP 
code.  The  first  example  is  a  direct  current  problem: 
The  penetration  of  a  red  blood  cell  (RBC)  through  a 
hole  in  a  membrane  is  measured  using  an  ohmmeter. 
To  predict  the  resistance  an  accurate  calculation  of 
the  highly  inhomogeneous  current  density  is  neces¬ 
sary.  The  second  example  treats  the  magnetostatic 
problem  of  cables  mounted  close  to  one  rail  of  a  rail¬ 
road.  The  screening  effect  of  the  rail  may  cause  prob¬ 
lems  when  the  cable  is  used  for  signal  transmission 
to  the  train.  The  third  example  deals  with  the  foot 
point  impedance  of  an  antenna.  The  feeding  area 
is  calculated  by  quasistatic  MMP  and  then  matched 
with  the  time  harmonic  solution. 

4.1  Detection  of  Red  Blood  Cells 

Since  the  conductivity  of  red  blood  cells  (RBC)  is 
much  lower  than  that  of  the  fluid  around  the  pres¬ 
ence  of  RBC’s  may  alter  the  electrical  resistance  of 
specific  situations.  The  effect  may  be  used  to  detect 
RBC’s  moving  through  membrane  holes.  A  simple 
model  with  an  insulating  membrane  with  a  single 
hole  is  shown  in  fig.  1.  Between  two  parallel  infinite 
plates  there  are  an  insulating  membrane  with  a  single 
circular  hole  and  an  insulating  RBC  imbedded  in  a 
conductive  medium.  The  geometrical  shapes  of  both 
the  (insulating)  membrane  and  the  RBC  are  simpli¬ 
fied  in  order  to  use  the  symmetry  of  revolution.  The 
picture  gives  a  plot  of  the  current  density  for  a  par¬ 
ticular  location  of  the  RBC.  The  electrical  resistance 
measured  between  the  plates  varies  strongly  when 
the  RBC  moves  through  the  hole.  Figure  2  shows 
the  relative  variation  of  the  resistance  when  the  RBC 
moves  from  the  top  plate  downto  the  bottom  plate. 

The  calculation  is  split  up  in  two  steps:  The  sit¬ 
uation  without  RBC  —  yielding  a  177  unknowns 
‘connection’^  for  the  second  step  —  and  the  influ¬ 
ence  of  the  RBC.  Both  memory  requirement  and 

^  See  end  of  sect.  3.1 


cpu  times  are  very  moderate  for  this  small  problem. 
Actual  values  for  a  generic  position  (960  matching 
points  model)  are  137(+177)  unknowns,  28  sec  cpu 
time^,  0.5  MByte  for  the  data  including  the  matrix 
and  0.6  MByte  for  the  program.  Thereby,  the  accu¬ 
racy  is  «  1%  for  individual  field  values. 

For  computing  the  resistance  R=U/I,  the  total 
current  I  is  obtained  by  integrating  the  current  den¬ 
sity  over  the  hole  area  while  the  voltage  U  is  simply 
given  by  the  boundary  values  of  the  potential. 
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Figure  1:  Two  infinitely  well  conducting  parallel 
plates  are  kept  at  potentials  0  V  and  1  V  respec¬ 
tively.  Both  the  membrane  and  the  ted  blodd  cell 
(RBC)  are  practically  insulating  compared  to  the 
blood  fluid.  The  shown  current  density  is  highly  in¬ 
homogeneous,  particularly  when  the  RBC’s  position 
is  close  to  the  membrane  hole.  The  dashed  vertical 
line  is  the  axis  of  rotational  symmetry  used  in  this 
model.  Exploiting  this  symmetry  yields  a  significant 
reduction  of  both  computation  time  and  storage  re¬ 
quirement. 


4.2  Magnetostatic  Screening  of  a  Rail 

For  transmitting  informations  from  and  to  moving 
trains  two  wire  transmission  lines  with  separately 
positioned  wires  are  mounted  along  the  railway  line, 
e.g.,  one  wire  very  close  to  one  rail  and  the  second 
wire  two  meters  apart  from  the  rail.  The  particular 
shape  of  the  rail  and  the  high  relative  permittivity 
(^r  fa  200)  of  the  iron  deform  the  magnetic  field  in 
the  close  environment  of  the  wire.  Figure  3  shows 
the  strong  screening  effect  with  one  wire  mounted  in 
the  groove  of  the  rail. 

^  All  cpu  time  values  for  a  Sparc2. 
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hole  RBC  position 


Figure  2:  The  resistance  R  —  measured  between  the 
two  plates  shown  in  fig.  1  —  depends  on  the  relative 
position  of  the  RBC.  This  picture  shows  the  varia¬ 
tion  of  R/ Rq  when  the  RBC  is  moved  along  the  ro¬ 
tational  axis.  i?o  is  the  resistance  without  any  RBC. 
Note  that  the  maximum  value  (RBC  centered  in  the 
hole)  may  be  calculated  using  an  additional  reflective 
symmetry  and  thus  allowing  much  higher  accuracy  at 
the  same  amount  of  calculation.  The  curve  is  sym¬ 
metrical  with  respect  to  the  hole  position.  Therefore, 
only  one  half  is  calculated, 
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Figure  S:  The  magnetic  field  —  the  picture  shows 
the  magnetic  induction  &  =  iiR  —  of  the  current 
flowing  in  the  wire  parallel  to  a  rail  is  strongly  de¬ 
formed.  This  is  due  to  the  shape  of  the  rail  and  the 
high  magnetic  permeability  {fiz)  of  the  iron.  The  re¬ 
turn  path  for  the  current  is  a  few  meters  apart  from 
the  rail.  See  also  fig.  4  for  a  more  general  view. 

However,  as  shown  in  fig.  4,  this  is  only  a  local 
effect  and  must  be  taken  into  account  when  the  de¬ 
tector  on  the  train  is  mounted  close  to  its  wheels. 

It  is  important  to  note  that  this  calculation  is 
purely  magnetostatic.  This  means  that  no  skin  ef¬ 
fect,  eddy  current  or  other  dynamic  effects  are  taken 
into  account.  Further  on,  we  are  discussing  a  two 
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Fit7«re  A:  Though  the  magnetic  is  deformed  in  the 
close  neighborhood  of  the  rail  (see  fig.  3)  the  effect  i 
small  far  away  from  the  rail. 


dimensional  (cylindrical)  model  here. 

The  contour  of  the  rail  is  discretized  using  688 
matching  points  and  as  much  as  693  unknowns  model 
the  field.  This  results  in  338  sec  Sparc2  cpu  time  for 
the  solution  and  about  2  MBytes  memory  require¬ 
ment  for  the  matrix.  The  use  of  so  many  unknowns 
for  a  relatively  simple  problem  yields  an  almost  ‘ana¬ 
lytical’  accuracy:  The  maximum  mismatching  is  be¬ 
low  0.1%. 


4.3  Footpoint  Impedance  of  an  Antenna 

As  already  mentioned  in  section  3.4  appropriate 
static  solutions  may  also  be  used  to  model  so-called 
quasistatic  fields.  As  an  example  we  treat  the  calcu¬ 
lation  of  the  foot  point  impedance  of  a  A/2-monopole 
antenna  mounted  on  an  infinite  ground  plane.  The 
feeding  area  is  modelled  very  accurately  in  order  to 
show  the  power  and  the  precision  of  the  method. 

In  a  first  step  the  electric  field  in  the  feeding  area 
is  calculated.  Figure  5  shows  the  situation  and  the 
resulting  field.  A  coaxial  cable  (with  a  dielectric) 
feeds  the  antenna  from  the  bottom,  the  ground  plane 
has  a  hole  and  is  connected  to  the  outer  conductor 
of  the  cable.  This  calculation  step  requires  58  un¬ 
knowns,  120  matching  points  and  about  1  second 
Sparc2  cpu  time.  The  voltage  is  given  by  the  poten¬ 
tial  boundary  values. 

The  second  step  deals  with  the  magnetic  field  and 
requires  no  unknowns,  no  matching  points  and  no 
cpu  time.  This  is  due  to  the  fact  that  the  magne¬ 
tostatic  field  is  just  the  field  of  a  straight  current. 
The  return  current  which  actually  flows  in  the  outer 
conductor  of  the  coaxial  cable  and  —  radially  —  in 
the  ground  plane  does  not  contribute  to  the  field  in¬ 
side  the  feeding  area  and  need  not  to  be  modelled. 
Figure  6  shows  the  result. 

The  third  and  last  step  combines  the  two  static 
fields  and  matches  them  to  the  dynamic  field  of  the 
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Figure  5:  The  electric  field  (triangles)  and  the  asso¬ 
ciated  equip otential  lines  in  the  feeding  area  of  an 
antenna,  calculated  by  means  of  electrostatics.  Only 
the  portion  below  the  dotted  arc  will  be  used  later. 

(The  outer  field  wOl  be  a  true  dynamic,  i.e.,  time 
harmonic  field.)  Note  that  even  the  permittivity  of 
the  coaxial  feeding  cable’s  dielectric  is  taken  into  ac¬ 
count.  The  model  has  a  symmetry  of  revolution  with 
respect  to  the  dash  dotted  axis  at  the  left. 

antenna.  Below  the  bell-shaped  boundary  shown  in 
figs.  5  and  6,  the  static  fields  E&tax  and  Hst&t  each 
multiplied  by  separate  complex  constants  (oe  and 
ajj)  are  valid  while  in  the  outer  area  a  conventional 
(i.e.,  time  harmonic)  MMP  expansion  is  used.  Either 
static  fields  are  matched  with  the  dynamic  field  on 
the  surface  of  the  bell.  The  computation  leads  to  cer¬ 
tain  constants  as  and  an-  Provided  that  the  static 
fields  are  associated  with  a  voltage  of  1 V  and  1 A  re¬ 
spectively,  the  complex  footpoint  impedance  simply 
becomes  Z  —  OE/aH-  Figure  7  shows  the  fitting  of 
the  static  and  the  dynamic  field  and  fig.  8  gives  the 
instantaneous  values  of  both  the  electric  field  and  the 
Poynting  field  S  close  to  the  A/2  monopole  antenna. 

The  dynamic  problem  consists  of  only  16  un- 
kowns,  the  current  in  the  antenna  modelled  with  10 
unkowns  and  a  few  multipole  functions  to  adapt  the 
solution  to  the  feeding  area  and  the  unknown  voltage 
(numerically  equal  to  oe).  The  current  (numerically 
equal  to  Oh)  is  set  to  1.  148  matching  points  were 
used  and  the  cpu  worked  for  154  seconds. 
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Figure  6:  The  magnetic  field  in  the  feeding  area  of 
an  antenna  is  calculated  by  means  of  magnetostat¬ 
ics.  It  is  equal  to  the  well  known  field  of  a  straight 
current  and  requires  pracitcaliy  no  cpu-time.  As  in 
fig.  5  only  the  portion  below  the  dotted  arc  will  be 
used  later. 
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Figure  7:  The  statically  computed  electric  field  in 
the  inner  area  fits  seamlessly  to  the  dynamically 
computed  field  in  the  outer  area.  The  same  is  true 
for  the  magnetic  field  (not  shown  in  the  picture). 
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With  the  new  static  MMP  program  both  electro¬ 
static,  magnetostatic  and  direct  current  problems 
may  be  solved.  Further  on,  the  static  solutions  may 
be  used  as  quasistatic  expansions  in  dynamic  situa¬ 
tions.  2D  and  3D  multipoles  and  regular  expansion 
functions  as  well  as  finite  length  line  currents  and 
line  charges  are  implemented.  Both  fields  and  their 
scalar  and  vector  potentials  may  be  used.  The  ex¬ 
amples  show  the  high  flexibility  and  the  wide  range 
of  applications  of  the  new  program.  A  beta- version 
of  the  program  is  available.  It  is  planned  to  add  the 
static  program  to  the  full  MMP  package. 
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Abstract 

Previoiis  work  related  to  the  characterization  of  aluminum  reduction  pots,  in 
which  extremely  high  currents  are  used  to  process  bauxite  into  aluminum,  has 
concentrated  on  finding  the  distribution  of  currents,  magnetic  fields  and  forces. 
With  these  computed  forces  and  magnetic  fields,  it  becomes  possible  to  model 
the  resultant  swirling  flow  of  molten  aluminum.  The  equations  that  govern  the 
molten  metal  flow  are  known  as  the  Navier- Stokes  equations.  Investigation  into 
a  numerical  solution  of  the  Navier-Stokes  equations,  in  the  context  of  aluminum 
reduction  pots,  is  reported  here. 


1  Introduction 

Previous  work  related  to  the  characterization  of  aluminum  reduction  pots,  in  which 
extremely  high  currents  are  used  to  process  bauxite  ore  into  aluminum,  has  concen¬ 
trated  on  finding  the  distribution  of  currents,  magnetic  fields  and  forces.  With  these 
computed  forces  and  magnetic  fields,  it  becomes  possible  to  describe  the  swirling 
flow  of  molten  aluminum  in  the  reduction  pots.  This  is  of  interest  to  commercial 
producers  of  aluminum  because  the  induced  molten  metal  motion  represents  a  sub¬ 
stantial  inefficiency,  or  loss  of  energy,  in  the  refining  process.  The  equations  which 
govern  the  motion  of  the  molten  metal,  and  viscous  flow  in  general,  are  known  as  the 
Navier-Stokes  equations.  This  paper  reports  on  an  investigation  into  the  numerical 
solution  of  the  Navier-Stokes  equations  in  the  context  of  aluminum  reduction  pots. 

There  are  two  basic  governing  equations  for  incompressible  fluid  flow,  which  in¬ 
volve  partial  derivatives  both  with  respect  to  time  and  space  variables.  These  two 
equations  also  involve  pressure  and  three  forces  -  magnetic,  flow  resistance,  and  ac¬ 
celeration.  It  is  possible  to  consolidate  these  into  a  single  equation  to  be  solved  [1], 
but  the  order  of  differentiation  is  increased  in  the  process  so  that  numerical  solu¬ 
tion  instabilities  are  inherent;  also,  pressure  does  not  appear  explicitly  in  the  final 
equation,  which  is  apparently  linked  to  convergence  problems. 

An  alternative  approach,  pursued  here,  is  to  solve  the  governing  equations  simul¬ 
taneously,  as  a  system,  at  a  lower  level  of  consolidation.  Treating  pressure  explic¬ 
itly  and  writing  separate  equations  for  the  velocity  components,  a  system  of  three 
equations  results.  A  relaxed  iterative  finite  difference  solution  of  the  Navier-Stokes 
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equations  has  been  implemented.  A  Gauss-Siedel  iteration  method  is  utilized  for  the 
simultaneous  solution  of  the  three  equations  [2]. 

Test  case  results  are  shown,  so  that  the  feasibility  of  the  method  is  established. 
The  range  of  validity  of  this  particular  numerical  solution  formulation  has  not  yet 
been  established,  and  will  be  a  matter  for  future  research  through  more  complex  case 
studies.  The  influence  of  system  parameter  values  (and  ratios)  on  convergence  are 
discussed. 

2  Governing  Equations 

The  tw^o  applicable  relations  for  incompressible  fluid  flow  are 

5V  1 

/9^  +  VP  =  JxB  +  /iVV+-V{V-V)  (1) 

and 

dP 

—  +  V  ■  (pV)  -  0.  (2) 

Ill  the  above  equations,  p  is  the  specific  density  of  molten  aluminum  (2.3  x  10^  kg/nr^) 
and  p  is  the  viscosity  (1.2  x  vector  quantities  are  velocity  V,  current 

density  J,  and  magnetic  flux  density  B. 

As  a  first  order  model  of  the  problem  of  interest,  it  is  assumed  that  all  quantities 
are  invariant  with  respect  to  depth,  so  that  work  here  is  in  a  z  =  constant  plane  (taken 
at  z  =  0  for  convenience).  The  rectangular  geometry  of  the  aluminum  reduction  pot 
is  shown  in  Figure  1. 


I 

I 

I 


T 

2 

Figure  1.  Aluminum  reduction  pot  geometry. 

Eqs.  1  and  2  involve  pressure  P  and  the  forces  Fm  (magnetic),  Fr  (flow  resistance), 
and  Frt  (acceleration).  The  gravity  force,  F^,  is  absent  by  virtue  of  investigating  only 
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horizontal  flow.  In  Eq.  1,  the  term  |V{V  •  V)  =  0  and,  in  Eq.  2,  the  term  =  0 

for  this  case.  ,  ^ 

J  X  B 

Dividing  Eq.  1  by  /?,  noting  that  Fm  “  — - —  {Fmx  and  Fmy  are  known  from 

calculation  elsewhere),  and  writing  separate  equations  for  the  velocity  components, 
one  can  then  write  the  equations  for  steady-state  incompressible  fluid  flow  as 


du  du 

u—  +  = 


dv  dv  1  dP  _2  /  .N 

u— — Vv—  —  — — — hEmy  +  r^V  V  (4) 

dx  dy  p  ox 

du  dv  . 

where  u  =  Vx^v  =  Vy  and  the  relation  p  =  pv  has  been  applied. 

Making  the  assumption  of  small  velocities,  which  is  physically  reasonable  in  this 
case,  the  equations  to  be  solved  may  be  somewhat  simplified  to  the  useful  approxi¬ 
mations  ^ 

vV^U  «  -  i^mx  (6) 

p  OX 

vV\  «  -  Fray  (7) 

p  dy 


du  _  A 
dx~^  dy 


3  Numerical  Method 

A  Gauss-Siedel  iteration  method  for  simultaneous  solution  of  Eqs,  6-8  has  been 
selected.  A  grid  of  points  is  established  in  the  z  =  constant  plane  (where  the  reduction 

L  ^  B 

pot  spans  0  <  x  <  L  and  0  <  2/  <  B  )  with  spacings  Ax  =  and  Ay  = 
where  NX  and  NY  are  the  number  of  subdivisions  desired  in  the  x  and  y  directions, 
respectively.  Three  matrices  are  utilized,  the  elements  of  which  are  storage  locations 
for  the  current  values  of  u,  u,  and  (pressure)  P  corresponding  to  the  individual  grid 
points.  Starting  with  arbitrary  initial  values,  the  computation  scheme  methodically 
updates  the  matrix  elements  in  an  iterative  manner  until  convergence  is  achieved  or 
ITMAX  (from  the  program  input  data)  iterations  have  transpired. 

To  implement  this  method  on  the  computer,  it  is  necessary  to  express  all  deriva¬ 
tives  present  in  the  equations  to  be  programmed  by  finite- difference  operator  approx¬ 
imations. 

When  Eqs.  6  through  8  are  programmed  using  a  mixture  of  forward  difference 
and  central-difference  operators,  the  result  exhibits  very  poor  convergence  behavior. 
Although  convergence  to  2%  (maximum  relative  change  in  any  matrix  element  dur¬ 
ing  an  update  iteration)  has  been  obtained  for  two  test  cases,  neither  solution  was 
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physically  meaningful.  That  is,  convergence  to  false  solutions  occurred  in  both  cases. 
Furthermore,  a  low  Aa:  to  Ay  ratio,  on  the  order  of  0.01  or  less,  has  been  found 
empirically  to  be  necessary  for  convergence  to  occur  at  all. 

Substantial  progress  occurs  when  the  problem  is  re-programmed  to  incorporate 
only  the  central- difference  operators 

^  =  +  (9) 


and 

^  ^  ^ 

Lower-case  letters  u,  u,  and  p,  where  accompanied  by  subscripts,  indicate  elements 
of  the  respective  matrices  U,  V,  and  P.  The  fact  that  the  error  for  each  of  these 
approximations  is  on  the  order  of  the  subject  increment  (Ax  or  Ay)  squared  is 
emphasized  by  inclusion  of  the  term  O  [(‘)^]  Eqs.  9  through  11. 

The  change  to  exclusive  use  of  central-difference  operators  makes  all  error  terms 
in  the  numerical  process  the  same  order  of  magnitude,  and  results  in  a  significant 
performance  improvement.  Convergence  capability  improves  sharply,  Ax  to  Ay  ratios 
up  to  0.5  become  useful,  and  some  physically  reasonable  solutions  may  be  obtained. 
We  shall  return  to  the  matter  of  final  solutions  after  the  numerical  form  and  program 
are  discussed.  First,  the  equations  as  they  are  actually  programmed  are  developed 
below. 


Applying  =  ■ 


and  the  relation  i>  =  -  to  Eq.  6, 


L  d‘^u 

IdP 

(12) 

2  Qy2 

—  o  -C  mx  • 

p  OX 

p  d'^u 

p 

(13) 

p  dx^ 

H +  Fmx  • 

p  oy^ 

obtains 

d'^u 

d^u  ^ 

(14) 

Re-arranging, 


Multiplying  both  sides  by  p,  one  obtains 


If  the  appropriate  finite-difference  operator  approximations  are  now  used,  the  equa¬ 
tion  becomes 

- 2Ar^-^  (A^p 
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or,  multiplying  by  2 Ax,  we  obtain  the  useful  form  of 

”  ~^X  +  Ui—lj)  (1^) 

Ax 

+2/X(a^)2  (Hj+i  ~  +  2pAxFmx’ 

It  is  common  practice  to  add  a  “relaxation”  term  as  a  variable  aid  to  faster 
convergence.  The  relaxation  term  simply  contributes  a  fractional  part  of  the  old 
value  into  the  new  value  being  computed.  By  re-arranging  and  adding  such  a  term, 
the  equation  for  computer  implementation  is 

Pi^lj  ~  Pi—lJ  +  d"  Ui—Ij)  (1"^) 

Ax 

^  d“  "b  ^pAxFjnx 

{Ayy 

-|-(1  —  Rp)Pi+lJ 

where  the  asterisk  on  Pl^ij  denotes  a  new  element  value,  replacing  the  old  value 
Pi+ij-  Note  that  the  effect  of  the  relaxation  term  is  negated  if  one  inputs  “1”  as 
the  value  of  variable  Rp.  Eq.  17  obviously  can  be  re-arranged  to  give  Pi-ij  on  the 

left-hand  side,  if  desired. 

du  dv 

Now  recall  that - \^  -—  =  0  from  Eq.  8.  If  the  finite- difference  operator 

ox  ay 

approximations  are  directly  substituted: 

~~  ~  _  Q  /-I  o\ 

2(Ax)  2(Ay)  ^  ^ 

Ax 

after  multiplication  by  2(Ax).  The  re-arranged  form  of  this  equation  becomes 

=  Ui+ij  -H  ^  (vij+i  -  ■  (20) 

Adding  a  relaxation  term  and  emphasizing  the  new  element  value  with  an  asterisk 


^  +  (1  “  (21) 

The  equation  for  is  similar  and  obvious.  Note  that  each  major  equation  has 

been  assigned  its  own  relaxation  term  -  involving  Ru  here.  As  indicated  earlier,  the 
effect  of  this  relaxation  term  can  be  negated  simply  by  setting  the  variable  Ru  equal 

1  dP 

Finally,  recall  from  Eq.  7  that  vV^v  ^  “  Pmy  Again  ^ 

substitute  for  : 
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4  Test  Case 

An  example  has  been  selected  for  illustration  of  the  solution  of  the  system  of  equations 
17,  21,  and  26.  The  forcing  function  of  the  example  problem  is  depicted  in  Fig.  2. 
The  function  is  defined  along  the  mid-line  in  the  x-direction  (i.e.,  at  2:  =  |  ).  It 
varies  sinusoidally  in  y  as  shown,  with  the  periodicity  chosen  so  that  one  complete 
cycle  exactly  spans  the  ?/  =  0  to  y  =  B  range.  More  grid  points  were  used  in  the 
lower  plot  so  that  the  sinusoidal  amplitude  variation  is  more  apparent.  Convergence 
in  this  case  was  well-behaved,  with  the  solution  converging  to  an  EPSMAX  of  0.002 
in  about  55  iterations.  The  maximum  change  in  any  element  of  all  three  matrices  U, 
V,  and  P  from  one  iteration  to  the  next  must  be  less  than  user-specified  EPSMAX 
for  successful  convergence  to  be  declared. 

The  numerical  solution  results  are  given  graphically  in  Fig.  3,  A  finer  grid  was 
used  in  the  calculation  of  Fig.  3b.  In  both  cases,  the  velocity  plots  substantially 
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Figure  2a.  Coarse  grid  spacing. 
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Figure  2.  Test  case  forcing  function. 
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agree  with  what  one  would  intuitively  expect  from  the  given  forcing  function.  There  is 
a  clockwise  rotation  of  the  molten  metal,  with  the  maximum  velocity  vector  lengths  in 
agreement  with  the  maximum  amplitude  region  of  the  forcing  function.  The  solution 
goes  to  zero  along  the  pot  (boundary)  walls,  as  it  should. 

For  this  test  case,  the  relaxation  factors  are  chosen  to  be  =  1.0  while  Rp  = 
Rv  =  1.08.  The  convergence  rate  is  somewhat  sensitive  to  the  relaxation  factors 
chosen,  but  the  nature  of  the  relaxation  is  analytically  unknown  at  this  time. 

5  Conclusions 

Numerical  solutions  to  the  Navier- Stokes  equations,  in  the  context  of  aluminum  re¬ 
duction  pots,  have  been  successfully  achieved  for  rudimentary  test  cases,  as  described 
above.  Solutions  for  other,  more  complicated,  forcing  functions  are  being  investigated, 
but  with  lesser  convergence  success.  It  has  been  observed  that  the  ratio  Ax  to  Ay  has 
a  strong  influence  on  convergence  behavior.  Similarly,  a  fortuitous  choice  oi  R^^Rv, 
and  Rp  is  essential. 

Considerable  research  and  analysis  will  be  required  to  define  the  operational  limits 
of  this  particular  numerical  formulation.  Beyond  that,  the  guidelines  for  prudent 
selection  of  parameters  for  a  given  application  must  be  determined.  There  may  be 
additional  aspects  for  consideration  yet  to  be  discovered,  artifacts  arising  from  certain 
classes  of  complex  forcing  functions. 

Nonetheless,  the  methodology  presented  here  has  demonstrated  (through  the  test 
case  example)  a  capability  heretofore  not  achieved.  There  is  clearly  cause  for  opti¬ 
mism  that  the  solution  method  can  be  extended  and  successfully  generalized  from 
this  basis.  The  progression  to  more  complex  cases  may  have  to  be  in  carefully  mea¬ 
sured  steps  of  increasing  diiSculty,  but  the  critical  matter  of  feasibility  has  been 
substantially  resolved  by  this  study  to  a  positive  conclusion. 
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Abstract 

In  this  paper,  the  Singularity  Expansion  Method  (SEM)  is  used  to  describe  the  electrostatic  charge 
distribution  on  an  array  of  thin  linear  antennas  placed  in  a  uniform  electric  field.  Using  the  SEM,  the 
step  plane  wave  induced  transient  current  on  the  array  is  expanded  in  terms  of  its  singularities  (poles) 
in  the  Laplace  transform  or  complex  frequency  domain.  Transient  charge  is  derived  from  the  induced 
current  expression  using  the  continuity  equation.  The  final  value  theorem  is  applied  to  the  transient 
charge  expression  to  determine  the  electrostatic  charge  distribution  on  the  array. 


I  Introduction 

The  Singularity  Expansion  Method  allows  one  to  treat  a  conducting  body  in  a  manner  similar  to  that 
used  in  classical  circuit  theory.  In  circuit  theory,  the  response  of  a  linear  circuit  excited  by  an  arbitrary 
waveform  may  be  determined  by  knowledge  of  the  location  of  any  singularities  of  the  response  function  as 
well  as  the  corresponding  residues.  In  a  distributed  system  (e.g.,  a  conducting  body),  an  infinite  number 
of  singularities  exist  and  associated  with  each  is  a  natural  modal  current  and  charge  distribution.  For  an 
arbitrary  excitation,  one  need  only  determine  how  much  of  each  natural  mode  has  been  excited.  This  is 
determined  by  the  coupling  coefficient  associated  with  a  given  natural  mode.  [1] 

The  essence  of  the  SEM  is  its  ability  to  break  down  the  electromagnetic  interaction  process  into  mean¬ 
ingful  parameters  such  as  poles,  modes  and  coupling  coefficients.  Definitions  for  these  SEM  parameters 
will  be  presented  employing  a  method  of  moments  approximation  to  the  continuous  operator  electric 
field  equation  (EFIE)  in  terms  of  the  current  induced  on  the  array.  The  main  intent  of  this  work  is  to 
provide  a  clearer  view  of  the  fundamental  aspects  of  electrostatic  interaction  than  is  available  through 
conventional  analysis  methods. 

II  Theoretical  Development 

As  a  starting  point,  consider  the  array  comprised  of  perfectly  conducting  thin  cylindrical  elements  as 
shown  in  Figure  1.  Impinging  on  this  array  is  some  incident  field  generated  by  a  distant  source.  The 
impinging  field  induces  current  and  charge  on  the  elements,  and  these  sources  in  turn  radiate  a  scattered 
field.  Due  to  the  perfectly  conducting  nature  of  the  elements,  the  total  tangential  electric  field  on  the 
surface  of  the  conductors  must  be  zero;  that  is,  the  tangential  component  of  the  incident  electric  field 
must  exactly  cancel  the  tangential  component  of  the  scattered  electric  held.  Utilizing  standard  potential 
theory  [2],  the  scattered  electric  field  can  be  expressed  in  terms  of  an  integro-differential  operation  on  the 
induced  current.  This  expression  and  the  boundary  conditions  result  in  an  electric  field  integral  equation 
(EFIE)  in  terms  of  the  unknown  surface  currents  on  the  the  elements  [3]. 
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Figure  1:  A  N-Element  Planar  Array 

An  approximate  solution  to  this  EFIE  can  be  obtained  by  employing  the  method  of  moments  [4]. 
A  pulse  expansion  of  the  unknown  current,  and  a  pulse  testing  of  the  EFIE  yields  the  familiar  network 
matrix  equation  for  the  induced  current  cast  explicitly  in  terms  of  the  complex  frequency  parameter 
s  —  0,  +  jo;,  _  _ 

I(s)  =  7.-\s)V{s)  =  Y{s)V(s)  (I) 

where 

Z(s)  is  the  system  impedance  matrix 
Y(s)  is  the  system  admittance  matrix 
I[s)  is  the  system  current  response  vector 
V{s)  is  the  system  voltage  excitation  vector 

The  inverse  of  the  Z{s)  can  be  expressed  as  the  ratio  of  its  adjoint  to  its  determinant.  Thus,  the 
singularities  (poles)  of  the  system  are  those  complex  frequencies  that  force  the  determinant  of  Z{s) 
to  zero.  These  singularities  are  termed  natural  resonance  frequencies  of  the  scatterer  since  they  are 
frequencies  for  which  the  body  can  have  a  response  with  no  excitation.  The  poles  must  lie  in  the  left 
hand  plane  to  ensure  a  decaying  response.  In  addition,  except  for  those  that  lie  on  the  real  axis,  the 
poles  must  occur  in  conjugate  pairs  so  that  a  real  time  response  exists.  Finally,  no  poles  can  lie  on  the 
juj  axis  because  the  body  is  losing  energy  through  radiation. 

Knowing  the  poles,  the  system  admittance  matrix  can  be  expanded  in  a  singularity  series,  hence  the 
name  SEM.  Expanding  Y(s),  the  expression  for  the  induced  current  becomes 

I{s)  =  V  V(s)  +  (2) 

where  is  the  ath  natural  resonance,  and  is  the  residue  matrix  at  the  ath  resonance.  The  term 
lYe(5)  is  a  vector  whose  elements  are  entire  functions,  i.e.,  it  has  singularities  only  at  infinity  [5].  Baum 
addresses  the  inclusion  of  lYe(5)  in  (2)  and  shows  that  it  is  not  necessary  for  the  SEM  characterization 
of  perfectly  conducting  cylinders  [6]. 

The  residue  matrix  Ra  at  can  be  evaluated  via  the  Cauchy  residue  theorem  [7],  which  is 

=  f  Y(s)ds  (3) 

27rj  Jca 
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where  denotes  a  contour  enclosing  the  pole  at  5  =  5^.  It  can  be  shown  that  is  dyadic;  that  is,  its 
elements  can  be  calculated  as  the  product  of  elements  taken  from  two  column  vectors  [5].  Furthermore, 
because  the  EFIE  is  used,  Ra  is  symmetric  which  means  the  column  vectors  are  identical.  Exploiting 
the  dyadic  and  symmetric  properties,  R^  can  written  as 

R„  =  pjijil  (4) 

where  Ma  is  the  ath  natural  modal  current  distribution  and  the  superscript  denotes  the  matrix 
transpose  operation.  The  parameter  Poc  is  a  complex  proportionality  constant  which  is  chosen  so  that 
the  maximum  value  of  Ma  is  real  and  unity. 

Using  the  factored  form  of  R^  and  dropping  the  entire  function,  the  expression  for  the  induced 
currents  becomes  _ ^ 

=  (5) 

The  voltage  excitation  vector  ^(5)  can  be  written  more  specifically  as 

V(5)  =  f(s)Ais)  (6) 

where  f{s)  is  the  functional  form  of  the  system  excitation  (impulse,  step...),  and  yV(5)  is  the  geometry 
dependent  impulsive  excitation  vector.  The  elements  of  A(5)  are  of  the  form  bexp{—ts),  where  b  is 
proportional  to  the  component  of  the  incident  electric  field  vector  along  the  body.  The  term  exp{—ts) 
represents  a  time  delay  of  t  seconds.  This  delay  ensures  causality  in  that  no  point  on  the  body  can 
respond  prior  to  the  arrival  of  the  incident  wavefront  [1]. 

Using  (6)  in  (7),  yields 

a  5  5o: 

The  SEM  parameter  knowm  as  the  coupling  coefficient  appears  in  (7),  and  it  is  defined  as 

Vcc{s)  =  PaM^Ais)  (8) 

This  parameter,  which  is  dependent  on  the  geometry  of  the  scatterer  as  well  as  the  polarization  and 
propagation  direction  of  the  incident  wave,  governs  how  much  the  ath  pole-mode  pair  couples  into  the 
response.  Substituting  (8)  into  (7),  the  current  response  vector  becomes 

=  f{^)  E 

a  ^  -  -So 

To  obtain  an  expression  for  the  charge,  the  continuity  equation  is  employed: 

p(3)  =  _iv-J(.)  (10) 

S 

Combining  the  continuity  equation  with  (9),  the  expression  for  the  charge  becomes 

p{s)  = -f{s)J2vc{s)-^^  =  f{s)f{s)  (11) 


where  r(5)  is  the  “network”  impulse  charge  response  and 

Da  =  V‘Ma 
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Ill  N umerical  Results 

In  this  section,  the  electrostatic  characterization  of  two,  three,  four,  and  five  element  arrays  will  be 
given.  The  elements  of  the  planar  arrays  under  consideration  have  a  radius-to-length  ratio,  a/X,  of 
0.005,  and  a  separation  distance-to-length  ratio,  djL  of  0.10.  The  radius  a  and  the  separation  distance 
d  parameters  are  illustrated  in  Figure  1.  The  electrostatic  charge  distribution  on  these  arrays  using  the 
SEM  wiU  be  compared  to  the  direct  MOM  solution  for  the  same  obtained  from  the  electrostatic  scalar 
potential  equation.  In  addition  to  presenting  the  electrostatic  charge  distribution,  a  discussion  of  some 
relationships  among  the  the  SEM  parameters  of  the  N-element  arrays  will  be  given. 

Before  presenting  the  results  for  the  two,  three,  four,  and  five  element  arrays,  it  is  interesting  to  give  a 
brief  discussion  of  the  SEM  parameters  associated  with  a  single  linear  conductor.  A  SEM  characterization 
of  a  single  linear  element  reveals  that  the  poles  of  this  object  occur  in  layers  roughly  parallel  to  the  ju; 
axis.  The  poles  in  the  fundamental  layer  lie  very  close  to  the  juj  axis.  Only  these  poles  contribute 
significantly  to  the  response  since  their  nearness  to  the  jw  axis  imphes  a  small  damping  constant.  The 
imaginary  parts  of  the  fundamental  layer  poles,  when  multiplied  by  the  length  of  the  element  L  and 
divided  by  the  speed  of  light  c,  occur  at  approximately  Ajtt,  where  k  denotes  an  integer.  Thus,  the 
first  pole  of  the  first  layer  Hes  near  uLjc  =  tt.  Singularities  residing  in  this  general  vicinity  are  termed 
fundamental  resonance  region  poles.  Other  poles  in  the  layer  nearest  the  imaginary  axis  are  referred  to 
as  harmonics  of  the  fundamental  and  have  imaginary  parts  near  27r,  Stt,  etc.  Another  way  to  view  the 
location  of  the  first  layer  poles  is  to  note  their  imaginary  parts  occur  at  frequencies  where  the  element 
length  is  an  integer  multiple  of  a  half- wavelength.  This  relationship  is  shown  as  follows 

=  ^  L^k\  (17) 

c  A  2. 

where  A  is  the  free  space  wavelength. 
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As  mentioned  previously,  associated  with  each  pole  is  a  unique  modal  current  distribution.  The  modes 
for  poles  lying  in  the  first  layer  have  sinusoidal  current  distributions  along  the  length  of  each  element.  The 
fundamental  resonance  region  pole  is  observed  to  have  a  half- wavelength  sinusoidal  current  distribution 
as  would  be  expected  from  (17).  Similarly,  second  harmonic  poles  have  a  full  wave  sinusoidal  current 
distribution.  Although  (17)  may  reveal  the  nature  of  the  distribution  for  a  given  harmonic,  what  is  not 
so  obvious  is  the  direction  of  the  current  flow  on  the  array  associated  with  a  given  mode.  Examining  the 
location  of  the  poles  and  the  characteristics  of  their  corresponding  modal  current  distributions  reveals 
much  about  the  fundamental  electromagnetic  interaction  process. 

Having  introduced  the  SEM  parameters  for  the  single  element,  the  pole-mode  relationships  for  the 
two,  three,  four,  and  five  element  arrays  can  now  be  discussed.  The  SEM  factorization  of  the  two  element 
array  reveals  that  two  poles  reside  in  the  general  location  of  the  complex  plane  where  the  isolated  element 
has  a  single  pole.  Thus,  in  the  fundamental  resonance  region,  the  two  element  array  has  two  poles. 
Furthermore,  two  poles  exist  in  each  of  the  higher  order  harmonic  regions  as  well.  Figure  2a  illustrates 
the  pole  positions  for  the  first  five  harmonics  of  the  two  element  array.  As  one  might  anticipate,  the 
three  element  arrays  has  three  poles  in  the  complex  plane  where  the  single  element  has  only  one.  The 
pole  positions  for  the  three  element  array  are  shown  in  Figure  2b.  This  trend  continues  as  more  elements 
are  added  to  the  array  such  that  a  N-element  array  has  N  poles  in  the  region  where  the  isolated  element 
only  has  one. 
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Figure  2:  The  pole  positions  for  the  first  five  harmonics  of  the  (a)  two  element  array  and  the  (b)  three 
element  array. 

The  addition  of  a  singularity  for  each  element  added  to  the  array  provides  for  some  interesting  pole 
mode  relationships.  The  two  poles  for  the  two  element  array  can  be  identified  as  either  a  radiation  pole 
or  a  transmission  line  pole  depending  on  the  directions  of  the  modal  currents  which  flow  on  the  elements 
of  the  arrays.  The  radiation  pole,  designated  with  the  symbol  *  in  Figure  2a,  has  modal  currents  that 
flow  in  the  same  direction  on  both  elements.  This  pole,  which  is  located  furthest  from  the  ju  axis,  is  low 
Q  (essentially  the  ratio  of  stored  energy  to  dissipated  energy);  therefore,  any  energy  that  is  coupled  into 
this  mode  will  be  rapidly  radiated  into  space.  In  contrast,  the  transmission  line  pole  has  modal  currents 
that  flow  in  opposite  directions.  In  accordance  with  Kirchoff’s  current  law,  a  symmetric  two  wire  array 
with  one  ampere  flowing  on  one  element  must  have  one  ampere  flowing  in  the  opposite  direction  on  the 
other  element.  The  transmission  line  pole,  designated  with  the  symbol  X,  is  located  very  close  to  the  ju 
axis  and,  therefore,  is  high  Q  (low  damping  constant).  The  real  part  of  the  modal  current  distributions 
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associated  with  the  fundamental  radiation  and  transmission  line  poles  are  shown  in  Figure  3.  The 
imaginary  part  of  the  modal  current  is  not  shown  since  it  is  very  small  relative  to  the  real  part. 


Figure  3:  The  real  modal  current  distributions  for  the  fundamental  radiation  and  transmission  line  poles 
of  the  two  element  array. 

The  three  element  array,  as  mentioned  previously,  has  three  poles  in  the  area  where  the  isolated 
element  has  only  one.  By  inspecting  Figure  2b,  one  can  observe  the  positions  of  these  singularities.  The 
three  element  array  also  has  a  radiation  pole  and  a  transmission  line  pole  designated  by  the  symbols 
*  and  X,  respectively.  These  poles  have  the  same  characteristics  of  the  radiation  and  transmission  line 
poles  of  the  two  element  array.  Of  the  pole  trio,  the  radiation  pole  resides  furthest  from  the  ju;  axis 
and  the  transmission  line  pole  lies  the  closest.  The  radiation  pole  has  modal  currents  that  flow  in  the 
same  direction  on  all  three  elements.  Furthermore,  it  has  the  highest  damping  and  lowest  Q  of  the  three 
poles.  The  transmission  line  pole  has  modal  currents  of  0.5  amperes  (amplitude)  flowing  in  the  same 
direction  on  the  outer  wires  and  one  ampere  flowing  in  the  opposite  direction  on  the  the  center  element, 
which  satisfies  KirchofF’s  current  law.  Because  of  its  close  proximity  to  the  ju  axis,  the  transmission  line 
pole  has  a  small  damping  constant  and,  therefore,  a  high  Q.  The  remaining  pole  is  referred  to  here  as 
the  zero-mode  pole.  As  its  name  suggests,  the  zero-mode  pole  has  no  current  on  the  center  element.  In 
Figure  2b,  this  pole  is  designated  with  the  symbol  0.  The  real  parts  of  the  modal  current  distributions 
of  the  the  fundamental  radiation,  transmission  line,  and  zero-mode  poles  of  the  three  element  array  are 
shown  in  Figure  4.  It  should  be  pointed  out  here  that  the  poles  of  the  arrays  are  system  poles;  that  is, 
a  pole  does  not  correspond  to  any  particular  element  of  the  array. 

The  four  and  five  element  arrays  possess  some  of  the  modes  of  the  two  and  three  element  arrays  as 
well  as  having  some  unique  modes  of  their  own.  As  expected,  the  four  element  array  has  four  poles  in  a 
given  harmonic  region  which  means  it  has  four  distinct  modal  distributions.  Two  of  these  poles  are  the 
familiar  radiation  and  transmission  line  poles.  These  poles  have  the  same  characteristic  modal  current 
distributions  as  discussed  previously.  The  radiation  pole  has  modal  currents  flowing  in  the  same  direction 
on  each  element,  and  relative  to  the  positions  of  the  other  three  poles,  it  is  located  furthest  from  the  jco 
axis.  An  inspection  of  the  modal  current  distribution  for  the  transmission  line  pole  reveals  one  ampere 
flowing  in  opposing  directions  on  the  inner  two  elements  and  0.5  amperes  flowing  in  opposing  directions 
on  the  outer  elements.  The  other  two  distinct  modal  current  distributions  of  the  four  element  array  are 
interesting,  but  a  physical  interpretation  is  not  so  simple.  The  current  distributions  of  these  modes  are 
oriented  such  that  the  net  current  flowing  over  all  four  elements  is  zero. 

As  one  would  expect,  the  five  element  array  has  five  distinct  modes.  Three  of  these  are  associated 
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-  Transmission  Line 
•  -Zero-Mode 
—Radiation 


Figure  4:  The  real  modal  current  distributions  for  the  fundamental  radiation,  transmission  line,  and 
zero-mode  poles  of  the  three  element  array. 


with  the  three  element  array.  Thus,  in  addition  to  two  more  singularities,  the  five  element  array  has 
a  radiation  pole,  a  transmission  line  pole,  and  a  zero-mode  pole.  These  three  poles  have  the  same 
characteristic  modal  distributions  described  earlier  for  the  three  element  array.  A  physical  interpretation 
of  the  modes  of  the  remaining  two  singularities  is  also  difficult.  One  significant  feature  of  these  modes  is 
that  their  distributions,  when  summed  over  all  five  elements,  is  approximately  zero. 

As  mentioned  in  the  previous  section,  the  coupling  coefficient  governs  how  much  of  each  mode  couples 
into  the  response.  The  electrostatic  coupling  coefficient  for  the  two  and  three  element  arrays  as  a  function 
of  the  angle  of  incidence  9  is  shown  in  Figure  5.  The  angle  of  incidence  and  polarization  of  the  incident 
electric  field  is  defined  in  Figure  1.  By  observing  Figures  5a  and  5b,  one  can  see  that  only  the  odd 
harmonics  of  the  radiation  pole  couple  significantly  into  the  response.  Furthermore,  the  amount  each 
radiation  pole  couples  into  the  response  decreases  with  the  increase  in  the  order  of  the  harmonic.  No  mode 
significantly  couples  into  the  response  beyond  the  7th  or  9th  harmonic.  The  strong  coupling  aspect  of 
the  radiation  pole  is  due  to  the  fact  that  its  modal  currents  flow  in  the  same  direction  on  all  the  elements. 
None  of  the  modes  of  the  even  harmonics  couple  into  the  response  because  their  distributions  have  an  odd 
symmetry  about  the  center  of  each  element,  and  the  excitation  is  an  even  function.  Furthermore,  except 
for  the  radiation  pole,  the  modes  of  all  the  other  singularities  are  oriented  such  that  they  effectively  cancel 
one  another.  As  an  example,  observe  the  modal  current  distribution  associated  with  the  zero-mode  pole 
of  the  three  element  array  in  Figure  4.  It  is  obvious  that  the  currents  on  the  outer  elements  cancel 
each  other,  and  the  current  on  the  center  is  already  zero.  A  similar  observation  can  be  made  about  the 
transmission  line  mode. 
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Figure  5:  The  magnitude  of  the  normalized  electrostatic  coupling  coefficients  for  the  (a)  two  and  (b) 
three  element  arrays.  The  notation  r*  represents  the  harmonic  of  the  radiation  pole. 


The  electrostatic  charge  distributions  for  the  two,  three,  four,  and  five  element  arrays  are  shown  in 
Figure  6,  7,  8,  and  9,  respectively.  The  electrostatic  responses  were  calculated  using  a  unit  step  plane 
wave  polarized  in  the  positive  2-direction  {0  =  90°).  Only  the  fundamental,  third,  and  fifth  harmonics 
were  used  in  computing  the  responses.  The  SEM  electrostatic  responses  are  compared  to  the  direct 
MOM  solution  of  the  electrostatic  scalar  potential  equation.  As  one  can  see,  the  SEM  agrees  well  with 
the  direct  MOM  solution  in  all  cases. 

IV  Conclusions 

In  this  paper,  the  Singularity  Expansion  Method  was  used  to  determine  the  electrostatic  charge  distri¬ 
bution  on  a  N-element  planar  array.  The  results  obtained  using  the  SEM  agreed  very  well  with  the  more 
conventional  MOM  solution  of  the  electrostatic  scalar  potential  equation.  In  addition  to  yielding  good 
results,  the  SEM  provided  a  more  meaningful  description  of  the  electromagnetic  interaction  process. 
This  was  accomplished  by  factoring  the  problem  into  its  SEM  parameters:  poles,  modes,  and  coupling 
coefficients. 

Through  numerical  experimentation  on  planar  arrays  of  two,  three,  four,  and  five  elements,  some 
interesting  relationships  among  the  SEM  parameters  were  observed.  Based  upon  these  results,  the 
following  conclusions  can  be  made  for  a  N-element  (N>2)  array: 

•  A  N-element  array  has  N  poles  in  the  region  where  a  single  element  has  only  one.  These  poles  are 
system  poles  and  are  not  related  to  a  single  element  in  the  array.  Associated  with  each  pole  is  a 
unique  modal  distribution. 

•  Two  of  the  N  poles  will  be  a  radiation  pole  and  a  transmission  line  pole.  Of  the  N  poles,  the 
radiation  pole  will  be  located  the  furthest  from  the  ju  axis,  and  the  transmission  line  pole  will  be 
closest. 

•  All  odd  N-element  arrays  will  have  a  zero- mode  pole. 

•  A  N-element  array  will  have  all  the  modes  of  the  N-2  array  plus  2  additional  distinct  modes. 

•  The  fundamental  radiation  pole  dominates  the  electrostatic  response. 
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Figure  6:  The  electrostatic  charge  distribution  for  the  two  element  array,  d/L  =  0.10,  a/L  =  0.005.  The 
applied  electric  field  is  oriented  in  the  positive  z- direction  (0  -  90  deg.). 


Figure  7:  The  electrostatic  charge  distribution  for  the  three  element  array,  d/L  =  0.10,  a/L  =  0.005. 
The  applied  electric  field  is  oriented  in  the  positive  z-direction  [6  =  90  deg.). 
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Introduction.  VIC-3D®  [1],  a  volume-integral  code  for  eddy-current  nondestructive  evaluation  (NDE), 
solves  the  problem  of  computing  the  interaction  between  flaws  and  probe  coils  (see  Figure  1)  [2],  [3].  The 


Figure  1:  Electromagnetic  NDE:  showing  a  probe  coil  interacting  with  a  flaw. 

probes  may  have  either  ferrite  or  air  cores  [4],  [5],  and  the  flaw  can  exist  in  either  ferromagnetic  or  non- 
ferromagnetic  plane- parallel,  layered  workpieces.  Because  the  fields  are  usually  quite  small,  we  can  ignore 
the  nonlinear  magnetization  characteristics  of  the  workpiece,  and  assign  a  scalar  magnetic  permeability 
to  each  layer. 

Volume- Integral  Equations  for  Ferromagnetic  Workpieces.  We  start  with  Maxwell’s  equations 

V  X  E  =  —juB 

V  X  H  =  ju)D  +  j('>  .  (1) 

Now,  H  =  B//i(r)  =  +  B/ft(r)  -  B//J/,  =  B/nh  —  Ma,  where  fih  is  the  host  conductivity,  and 

Ma  is  the  anomalous  magnetization  vector.  Thus,  the  second  of  Maxwell’s  equations  may  be  written 

V  X  B//tfc  =  juD  +  J(')  +  V  X  Ma  ,  (2) 

which  makes  clear  that  the  Amperian  current,  V  x  Ma,  is  an  equivalent  anomalous  electric  current  that 
arises  because  of  the  departures  of  the  magnetic  permeability  of  the  workpiece  from  the  host  permeability, 
on  the  other  hand,  is  an  electric  current  that  includes  the  anomalous  current  that  arises  due 
to  differences  in  electrical  conductivity;  =  ahE  +  (o-(r)  -  a h)E  =  4-  Ja,  Because  the  host 

conductivity  and  permeability  are  constant  within  each  plane-parallel  layer,  they  can  be  accounted  for 
by  means  of  Green  functions.  This  leaves  us  with  only  the  anomalous  electric  and  magnetic  sources  to 
be  determined. 

Even  though  the  Amperian  current  is  electrical  (because  it  appears  as  a  source  term  in  the  second 
Maxwell  equation  (Ampere’s  law)),  we  will  refer  to  it  as  to  remind  us  that  it  is  of  magnetic  origin, 
and  to  distinguish  it  from  (which  now  stands  for  the  anomalous  electric  current,  Ja).  The  important 
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point,  however,  is  that  because  the  Amperian  current  behaves  as  an  electrical  current,  we  need  only  use 
electric-electric  Green  functions  in  the  formulation  of  the  problem. 

In  establishing  the  volume-integral  equations,  we  simply  make  use  of  the  fact  that  the  total  electric 
field  and  magnetic  flux  density  at  a  point  is  the  sum  of  the  fields  due  to  the  probe  coil,  which  we  call  the 
incident  fields,  and  those  due  to  the  anomalous  currents,  and 
Hence,  we  write 

EW(r)  =  -  E(“)(r)  [j(')]  -  EW(r)  [jW]  -  E<°)(r)  [j*”*)]  -  E(")(r)  [jt")] 

+^V  X  EW(r)  [j!”")]  +  iv  X  E(^)(r)  [jf”)]  .  (3) 

In  arriving  at  the  second  equation,  we  have  used  the  fact  that  B  =  -■(l/ja;)V  x  E,  and  - 

The  first  part  of  the  first  equation  in  (3)  is  the  electric- electric  (ee)  interaction,  and  the  second 
part  is  the  electric- magnetic  (em)  interaction.  The  two  parts  of  the  second  equation  are,  respectively, 
the  magnetic-electric  (me)  and  the  magnetic-magnetic  (mm)  interactions.  We  decompose  the  various 
interactions  into  the  ‘infinite- space’  part,  designated  by  the  superscript,  (0),  and  the  ‘layered- space’  part, 
designated  by  the  superscript,  ($).  This  is  done  for  convenience  in  coding  and  problem  solving. 

The  electric-electric  interaction  terms  are  given  by 

E(°)(r)[j('>]  =  J r')jW(rVr'+  J -  r')V'  •  jW(r')*'  ,  (4) 

and 

E(’)(r)[J<')]  =  J  -x',y-  y',  z  -  +  ^2  J  dT'G^“}(x  -x',y-  y',  z  -  z')4‘\i') 

+  J  dr'GW(‘)(i  -x',y-  y',  z  +  +  Si,  J  dr'Gg^x  -x',y-  y',  z  +  2') 4“)(r') 

+  V,  I  -x',y-  y',  z  -  2')V;  •  jW(r') 

+  VtJ  dT'G^^py{x  -x',y-  y',  z  +  z')V[  ■  jW(r') 

+  JdT’Gii{x-x',y-y',z-z')V'.3(‘)(r') 

-  -x',y-  y',  z  +  2')V'  •  J<')(r')  .  (5) 

The  electric-magnetic  interaction  terms  are  given  by  substituting  for  j(®)  in  (4)  and  (5);  because 
has  zero  divergence,  we  have 

E(o)(r)[j('")]  =  y$W(r-r')J<'")(r')*' 

EM(r)[j(-)]  =  J  dr'GW(“)(x  -x',y-  y',  z  -  2')J^>(r') 
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+  a,  y  dv'GtX^  -x',y-  y',  z  -  z')jXHi') 

+  J  dr'G^J-X'‘\x  -  s',  y  -  y',  z  +  2')jS”‘'(r') 

+  y  dT'G^'Xi,!  -  s',  y  -  y',  2  +  z')jX\r') 

+  V,  y  *'gP)(‘‘)'(s  -  s',  y  -  y',  2  -  2')y;  •  j'^V') 

+  V,  y  dr'GW(‘)'(s  -  s',  y  -  y',  2  +  2')V;  •  J<”>(r')  , 


,  _  d  d  dJx  ,  dJy  T  ,  7  A 

where  Vt  =  •  J  =  ;  Jf  =  4*  a^Jy,  and 


47r|r  -  r^l 

roo 


In  the  Bessel  transform,  r  =  [(x  —  x^Y  +  (3/  —  2/0^]^^^* 

The  various  layered- space  Green  functions  are  given  by 


27r  7o  2A0 


G(lK<-)'(s-s',y-y',2-2')  =  f 


G(i)W'(s-s',y-y',2  +  2')  =  ^ 


2Ao  P 


Jo{rl)ldl 


GM(s-s',y-y',2-2') 


^  /■°°  rc(s'-s)^_2A„ 


Jo{rl)ldl  ,  (8) 

where  c(,j'  —  2:)  =  Aq  =  (/^  —  ^  is  the  thickness  of  the  0th  layer,  which 

contains  the  flaw. 

Note  that  the  terms  with  the  superscript  (a)  are  convolutional  in  all  three  spatial  variables,  whereas 
those  with  (6)  are  convolutional  in  (x  -x')  and  (y  —  y^)^  but  are  correlational  in  {z-\-z*),  where  z  is  normal 
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to  the  layers  of  the  workpiece.  This  decomposition  is  of  great  importance  when  solving  large  systems  of 
equations,  as  we  shall  see  later. 

The  G’s  are  defined  in  terms  of  the  TE  and  TM-mode  reflection  coefficients; 


= 


<^1,-1  - 


(E)  r,(£) 


Kf’R 


(M)p(M) 


g(^)  = 


RY  ’R\ 


gW  = 


4^ 


■2AoT 


gL?  = 


j(E) 

^-1 


l-4")4">e-2Aor’ 


1  -  ij5®>ilL®’e-2AoT 


(9) 


which  are,  in  turn,  computed  using  a  recursion  relation  for  stratified  media  [6]. 
The  magnetic-electric  operators  are 


V  X  E(°)(r)  [j(')] 

= 

Vx  y#(')(r- r')j''>(i 

:')dr' 

V  X  E("l(r)  [jt'>] 

= 

V  X  y  <ir'GW(“)(s  -  x'. 

,s-z')jW(r') 

4- 

V  X  a,  y  rfr'GW(i  -  x' 

,y-y' 

,z-z')jW(r') 

+ 

V  X  y  -  x', 

>y-y\ 

z  +  z')jS')(r') 

+ 

rfr'Gg(®  -  s' 

,y-y' 

,z  +  z')Ji'>(r') 

+ 

V  X  V(  y  dr'Gi^j  <“>'(!  - 

-^\y- 

-j/',z-z')V;. 

4- 

dr'GW<*')'{x  - 

-^\y- 

■y',z  +  z')Vf. 

with  the  same  expressions  holding  for  V  x  Ef°^(r)  ?  V  x  E^®)(r) 

Discretization  via  the  Method  of  Moments  (Galerkin)  Define  a  regular  grid  in  three-dimensional 
space,  with  grid  spacing  6x^  6y^  Sz.  Relative  to  this  grid  we  define  7r(x)  to  be  the  unit  pulse 


7r{x)=z 


{ 


1, 

0, 


if  0  <  z  <  1 
otherwise. 


(11) 


and  Trjn+i(x)  to  be  the  mth-order  convolution  of  7r(ar)  (we  define  7ri(z)  =  7r(z)). 
Next,  expand  the  current  vector  as 


Jx{r)  — 

KLM 

Jy{r)  = 

Mr) 


E 

KLM 


jiz)  ji(^)(e)/  \  . 

'^KLM^KLM\r}  j 


(12) 
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the  expressions  for  are: 

=  T^2k{xl^3:)T:xi{ylSy)'Kijri{zl6z) 

=  '^\k(^l^^)'^2l{ylhyim{zldz) 

'^kfm\^)  =  T^\k{xl^xyn{yl6y)T:2m(zl8z)  (fc,;,m)  =  (0, 0,0),. .  .,(iVa;,  JVy,iV;s)  ,  (13) 

where  T^im{yl^y)  is  the  mth  unit  pulse  function,  and  '!^2k{xlbx)  is  the  Arth  tent  function,  which  is  the 
convolution  of  7ri;t(x/^x)  with  itself. 

The  TW(')(r)  are  called  facet  elements,  because  the  qth.  element  is  constant  over  the  ^th  facet  of  the 
klmth  cell.  They  are  used  to  ensure  that  the  divergence  of  the  current  density  remains  bounded.  Facet 
elements  have  been  called  ‘volumetric  rooftop’  functions  in  [7].  Volumetric  rooftop  functions  have  also 
been  used  in  [8]  and  [9]. 

Because  J^’”^(r)  =  V  x  Ma(r),  we  expand  Ma(r)  in  edge-elements,  which  have  the  required  differen¬ 
tiability  of  the  curl  operation 

KLM 

KLM 

KLM 

where 

'^KLm\^)  =  ^lK{x)ir2L{y)T^2M{z) 

=  ^2K(x)riL(y)T^2M(z) 

^KLm\^)  =  T^2K{x)Tr2Liy)^lM{z)  •  (15) 


These  functions  are  called  edge-elements  because  the  expansion  coefficient,  (constant) 

value  of  Mx  along  the  x-directed  edge,  (y  =  (£  -|-  l)Sy,  z  =  (M  +  l)Sz).  There  are  similar  interpretations 
for  and 

The  components  of  the  magnetic  current  vector  are  given  by 

/(-)  „  dMy 

dy  dz 

=  Y1  [^pLM'^'2K{xy2L(.y)-^\M{z)  -  M^j^l,^T2K(x)TriL{yy2M{^)] 

KLM 

T(m)  _  dM^ 

^  dz  dx 

=  Y.  [^pLM^lK{x)T^2L(yy2M(^)  “  ^KLM^2K(^yL{yyM(z)\ 

KLM 

7(m)  _  dM^ 

dx  dy 
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=  -  MP[^^nK(xy2dy)T^2Miz)] 

KLM 

V  .j("^)  = 

^  ^  dydz  dxdz 

KLM 

where  the  last  term  is  the  transverse  divergence  of  the  Amperian  current. 

Note  that  we  can  express  in  terms  of  i.e.,  the  edge  elements  and  the  facet  elements  are 
related.  This  simplifies  a  number  of  calculations  involving  ferromagnetics. 

We  will  discretize  (3)  by  employing  Galerkin’s  method,  which  uses  the  same  vector  functions  for 
expansion  and  testing-  The  spatial  derivatives  that  could  cause  problems  will  be  removed  by  the  testing 
process.  Take  moments  of  each  of  the  first  three  equations  of  (3)  with  the  corresponding  facet  element, 
and  of  the  second  three  equations  with  the  corresponding  edge  element.  The  result  for  the  electric 
equation  is 


■  ‘ 

O 

O 

(ee) 

■  j(^)  ■ 

0  0 

j(y) 

e(*^) 

0  0  qW 

j(^) 

-f 

"I 

(ee) 

‘  j(^)  ’ 
j(l/) 

qU) 

*^(0) 

^(0)  . 

. 

G<:f 

^(xz)  -| 

(«) 

■  j(^)  ■ 

+ 

j(v) 

Gf-i) 

gU) 

qU) 
^(a)  J 

.  . 

+ 

r 

1 

(ee) 

j(y) 

gIv) 

^(6) 

qU) 
^(6)  J 

G(o)  G(„) 


G(j) 


■  (em) 


M(^) 


(17) 


where  the  Q’s  are  tri-diagonal  matrices,  the  G(o)’s  the  infinite- space  matrices,  the  G(a)’s  the  convolu¬ 
tional  layered-space  matrices,  and  the  G(j,)’s  the  correlational  layered-space  matrices.  The  infinite-space 
matrices  are  convolutional,  also.  The  superscript  (ee)  denotes  electric- electric  matrices,  and  (em)  denotes 
electric-magnetic  matrices.  The  J’s  are  the  unknown  electric  currents,  and  the  M’s  are  the  unknown 
magnetic  polarization  vectors.  The  last  block  in  (17)  is  simply  a  short-hand  representation  of  the  three 
blocks  above  it,  except  that  it  represents  electric- magnetic  interactions. 

The  magnetic  equation  is  similar  to  (17),  and  is  given  by 


b(*^)  ■ 

■  QW  0  0  1 

(mm) 

■  ■ 

B(*y) 

= 

O 

O 

M(v) 

b(*^) 

o 

o 

mw 
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3E)  5S)  ,,8) 

^  5“i,  53,  °a 

L  ^(0)  ^(0)  ^(0)  J 

r  g(^^^  *i(^”^) 


!“\  T\  T\ 

Gkf  gm  g  ::f 

Mzx)  Mzy)  Mzz) 

^(a)  ^(a)  ^(a)  J 

p(xr)  Mxy)  „(xz)  1  ("■”*) 

at,  5?„  l& 

5^)  3?.)  S") 


G(0)  G(a)  G(j,) 


where  B  is  the  incident  magnetic  flux  density  due  to  the  coil,  the  superscript  (mm)  stands  for  magnetic- 
magnetic  interactions,  and  (me)  stands  for  magnetic- electric  interactions.  The  magnetic-magnetic  Q 
matrices  are  a  little  more  complicated  than  the  electric-electric  ones.  The  important  point  is  that  the 
electric- magnetic,  magnetic-electric,  and  magnetic-magnetic  matrices  can  he  computed  from  the  electric- 
electric. 

We  take  advantage  of  the  convolutional  and  correlational  structure  of  the  matrices  by  using  con¬ 
jugate  gradients  to  solve  the  system  of  equations.  The  vector-matrix  products  are  evaluated  using 
three-dimensional  FFT’s  [4]. 

Impedance  of  Flaws  in  Ferromagnetic  Bodies.  The  reaction  of  field,  on  source,  is 


[1,2]  =  1 4’ 


j(i)  . 


The  source  with  superscript  1  is  the  primary  source  due  to  the  exciting  coil,  and  superscript  2  denotes 
scattered  fields  (and  their  sources)  due  to  the  flaw.  If  Ic  is  the  current  in  the  exciting  coil,  then  the 
change  in  impedance  due  to  the  flaw,  as  seen  by  the  coil  is 

AZ  =  =  Jll^  ,  (20) 

■*C  •‘c 

where  we  have  used  the  reciprocity  theorem.  If  we  normalize  the  excitation  to  be  /c  =  Ij  tlien 


AZ  =  -EWdy 

=  -y(J  +  VxM)-EW<iF 

=  -  /  J  •  -  y  M  ■  V  X  E(’)<iy 

=  -  y  J  •  E<‘>dF  +  y  M  •  ju)B^''>dV  , 


where  we  have  dropped  the  superscript  2,  and  replaced  the  superscript  1  by  (z),  to  denote  incident  fields. 
The  transference  of  the  curl  operator  in  going  from  the  second  to  the  third  equation  is  valid  for  M  with 
finite  support. 
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Upon  substituting  tbe  expansions  for  the  electric  and  magnetic  solution  vectors,  (12)  and  (14),  into 
(21),  we  get 


AZ 


=  -E 

KLM 


[rW 

'^KLM 


(y)  p{iy)  ,  t(^) 

KLM^KLM  + 


r(*^) 

KLM^KLM 


-7W 


y^^KLM^KLM  +  ^KLM^KLh 


l<r  T .\4  ^  T^■  T  » 


■)]■ 


(22) 


This  is  a  sum  of  dot-products  of  the  electric  current  and  magnetic  polarization  solution  vectors  with  the 
incident  electric  field  and  magnetic  flux-density  vectors. 


References 

[1]  VIC-3D:  An  Eddy-Current  NDE  Code  User’s  Guide,  Sabbagh  Associates,  Inc.,  4635  Morningside 
Drive,  Bloomington,  IN  47408,  September  1993. 

[2]  H.  A.  Sabbagh,  L.  D.  Sabbagh,  and  J.  R.  Bowler,  “A  Volume- Integral  Code  for  Eddy-Current 
Nondestructive  Evaluation,”  COMPEL,  Volume  9  (1990),  Supplement  A,  pp.  67-70. 

[3]  J.  R.  Bowler,  L.  D.  Sabbagh,  and  H.  A,  Sabbagh,  “Eddy-Current  Probe  Impedance  due  to  a  Surface 
Slot  in  a  Conductor,”  IEEE  Trans.  Magnetics,  Vol.  26,  No.  2,  March  1990,  pp.  889-892. 

[4]  J.  R.  Bowler,  L.  D,  Sabbagh,  and  H.  A.  Sabbagh,  “A  Theoretical  and  Computational  Model  of  Eddy- 
Current  Probes  Incorporating  Volume  Integral  and  Conjugate  Gradient  Methods,”  IEEE  Trans. 
Magnetics,  Vol.  25,  No.  3,  May  1989,  pp.  2650-2664. 

[5]  H.  A.  Sabbagh,  J.  R.  Bowler,  and  L.  D.  Sabbagh,  “A  Model  of  Eddy-Current  Probes  with  Ferrite 
Cores,”  Nondestr.  Test.  Eval.,  Vol.  5,  pp.  67-79. 

[6]  G.  J.  Burke,  C.  G.  Dease,  E.  M.  Didwall,  R.  J.  Lytle,  “Numerical  Modeling  of  Subsurface  Commu¬ 
nication,”  UCID-20439  Rev.  1,  Lawrence  Livermore  National  Laboratory,  August  1985. 

[7]  M.  F .  Catedra,  E.  Gago,  and  L.  Nuno,  “A  Numerical  Scheme  to  Obtain  the  RCS  of  Three- 
Dimensional  Bodies  of  Resonant  Size  Using  the  Conjugate  Gradient  Method  and  the  Fast  Fourier 
Transform,”  IEEE  Trans.  Antennas  and  Propagation,  Vol.  37,  No.  5,  May  1989,  pp.  528-537. 

[8]  A.  Peter,  M.  Zwamborn,  P.  M.  van  den  Berg,  J.  Mooibroek,  and  F.  T.  C.  Koenis,  “Computation  of 
Three-Dimensional  Electromagnetic- Field  Distributions  in  a  Human  Body  Using  the  Weak  Form  of 
the  CGFFT  Afethod,”  Applied  Computational  Electromagnetics  Society  Journal,  Winter  1992  Vol 
7,  No.  2,  pp.  26-42. 

[9]  P.  Zwamborn  and  P.  M.  van  den  Berg,  “The  Three-Dimensional  Weak  Form  of  the  Conjugate  Gradi¬ 
ent  FFT  Method  for  Solving  Scattering  Problems,”  IEEE  Trans.  Microwave  Theory  and  Techniques, 
Vol.  40,  No.  9,  September  1992,  pp.  1757-1766. 


116 


SFSSrON3: 

RESEARCH  AND  ENGINEERING 
FRAMEWORK  FOR  CEM 


Chair:  K.  Siarkiewicz 


RESEARCH  AND  ENGINEERING  FRAMEWORK  (REF)  FOR 
COMPUTATIONAL  ELECTROMAGNETICS* 

B.  Hantman§,  K.  Siarkiewicz^,  J.  Labelle§,  R.  Jackson^ 

Correspondence  to: 

Barry  Hantman 

Raytheon  Company,  Mailstop  T3IV1J21 
Tewksbury,  MA  01876 
Phone:  (508)  858-5778 
FAX:  (508)  858-5976 
Email:  hantman@msmg02.msd.ray.com 


MMACE  /  A  TRI-SERVICE  PROGRAM 


ABSTRACT 

Scientific  and  engineering  research  and  design  codes  are  typically  FORTRAN-based,  having  vastly 
different  data  structures  and  I/O,  and  unique  and  inconsistent  user  interfaces  resulting  in  long  learning 
curves  and  difficulties  in  usage.  This  is  true  of  the  codes  implementing  the  various  computational 
electromagnetics  (OEM)  formulations.  The  result  is  numerous  opportunities  for  the  introduction  of  human 
error  in  the  design  process  because  of  usage  difficulties,  the  required  manual  manipulation  of  data  from 
code  to  code,  the  inability  to  graphically  view  the  geometry  and  directly  access  the  geometry  to  build  the 
analysis  model,  and  the  inability  to  ensure  that  all  design  codes  are  working  with  valid  input  data.  This 
situation  also  adversely  affects  the  cost  (human  and  software/hardware)  of  maintaining  and  extending 
design  capabilities  as  the  theoretical  foundation  and  design  needs  advance.  The  Research  and 
Engineering  Framework  (REF),  developed  as  part  of  the  Microwave  and  Millimeter-Wave  Advanced 
Computational  Environment  (MMACE)  program,  provides  a  coherent,  heterogeneous  design  environment 
which  is  the  first  step  toward  addressing  the  above  problems.  Although  the  REF  was  initially  developed  to 
address  the  problems  of  the  power  tube  industry,  the  underlying  framework  is  open  in  the  sense  of 
accommodating  a  broad  range  of  scientific  and  engineering  disciplines,  including  CEM  modeling  and 
simulation.  The  REF  focuses  on  providing  enabling  technology  in  several  critical  areas  including;  ease  of 
use  and  consistency;  support  for  a  "master"  geometry:  data  sharing;  integration  of  codes  at  different 
dimensionalities;  heterogeneous  networking;  numerical  grid  generation;  visual  connection  of  tools  and 
data  flows;  and  configuration  management  of  design  data  for  project  teams. 

The  REF  was  first  released  to  the  US  microwave  tube  industry  In  January  1994  for  evaluation  and  testing. 
The  lessons  learned  from  development  and  usage  have  been  used  to  refine  the  system  and  expand  on  its 
capabilities.  The  REF  is  currently  being  used  in  industry,  universities,  and  government  laboratories. 

This  paper  provides  detail  to  the  concepts  outlined  in  previous  ACES  symposia  [1-2]  and  other 
conferences  [3].  It  contains  an  overview  of  the  REF  architecture,  its  functionality  and  its  capabilities.  Also 
included  is  a  description  of  how  the  REF  can  be  applied  to  the  codes  and  requirements  of  the  CEM 
community. 
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BACKGROUND 


The  MM  ACE  program  has  been  underway  for  over  two  years  and  is  now  in  its  second  phase.  The  first 
phase,  which  began  in  1992,  was  an  investigation  phase.  Four  teams  were  put  in  place  to  study  the 
problems  of  the  Microwave  Power  Tube  industry  and  develop  roadmaps  for  the  implementation  of  a 
framework  to  address  those  problems.  The  teams  found  an  existing  design  environment  that,  was  based 
largely  on  in-house  developed  software;  was  largely  unsupported  by  commercial  vendors;  did  not  take 
advantage  of  the  latest  advances  in  CAD,  visualization,  database  technology,  and  networking;  was 
addressing  technically  difficult  problems;  contained  a  collection  of  tools  that  were  not  well  integrated  with 
each  other;  and  was  crucial  to  projects  deemed  important  to  the  US  government.  While  these  items  were 
true  of  the  power  tube  industry,  the  same  can  be  said  of  many  niche  scientific  disciplines  including  CEM. 
Once  the  investigation  was  complete,  the  four  MMACE  teams  began  constructing  a  prototype  system  to 
experiment  with  some  of  the  proposed  solutions.  This  prototype  was  released  in  early  1994  and  the 
system  is  now  in  production  use. 

Phase  II  of  the  MMACE  program,  scheduled  to  last  three  years,  began  in  March  of  1994.  Raytheon 
Company  and  Compact  Software,  along  with  several  subcontractors^  are  under  contract  to  jointly 
develop  the  complete  framework.  However,  it  became  obvious  that  the  problem  should  be  split  into  two 
parts.  The  first  part  of  the  problem  addresses  those  areas  specific  to  power  tube  design.  The  second 
part  of  the  problem  addresses  areas  that  are  common  to  multiple  design  disciplines  and  therefore  have 
far  greater  applicability.  The  portion  of  the  system  common  to  multiple  disciplines,  including  CEM,  is 
referred  to  as  REF,  the  Research  and  Engineering  Framework.  Version  1.2  of  the  REF  was  shipped  in 
November  of  1994  and  is  now  available  for  both  the  Hewlett-Packard  and  Sun  platforms.  Source  code  is 
available  for  those  wishing  to  port  the  system  to  additional  platforms. 

OVERVIEW 

The  REF  contains  programming  interfaces,  standards,  and  tools  to  aid  in  the  integration  of  codes.  In 
addition,  it  provides  the  data  management,  networking,  visualization,  and  "Master"  geometry  needed  to 
drive  analysis  codes.  The  REF  also  provides  the  utilities  needed  to  construct  graphical  user  interfaces 
including  the  top  level  control  panel.  The  REF  portion  of  the  system  can  be  compared  to  a  computer 
operating  system:  generic,  powerful,  and  providing  much  functionality  but  of  little  interest  to  the  end  user 
who  requires  software  that  is  written  to  operate  on  top  of  it.  Rgure  1  shows  the  structure  and  major 
components  of  the  REF. 
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Figure  1  -  REF  Architecture 


■*  The  Compact/Raytheon  Team  includes  Sabbagh  Associates,  Hughes,  Litton,  ITT,  Teledyne,  and 
Varian  as  subcontractors. 
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On  Top  of  the  REF,  one  can  iayer  design  tools  specific  to  a  given  design  discipline,  such  as  CEm.  For  the 
MM  ACE  program,  we  have  tested  the  REF  by  integrating  a  coiiection  of  design  toois  used  in  the  design 
and  analysis  of  power  tubes  and  the  data  dictionary  describing  the  data  elements  needed  for  that  design 
and  analysis.  These  design  and  analysis  codes  can  be  compared  to  off  the  shelf  software  that  can  be 
loaded  on  a  system  and  accessed  by  the  end  users  to  perform  specific  functions.  However,  like  off  the 
shelf  software,  without  the  underlying  operating  system  (in  this  case  the  REF)  the  software  would  be 
useless.  Figure  2  shows  the  REF  with  discipline  specific  tools,  databases,  and  data  dictionaries 
connected  to  provide  the  user  with  a  complete  environment.  Although  the  specific  codes  that  would  be 
integrated  for  C EM  would  be  different  from  those  that  have  been  Integrated  to  date,  the  concept  Is  the 
same. 
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Figure  2  <  REF  Environment  Customized  for  a  Specific  Design  Discipline 
RESEARCH  AND  ENGINEERING  FRAMEWORK  (REF)  DETAIL 

While  Phase  I  of  the  MM  ACE  program  concentrated  on  developing  design  requirements  and 
implementation  of  a  prototype  framework.  Phase  II  focuses  on  enhancing,  extending,  and  developing 
framework  technology  in  eight  key  areas: 

-  Ease  of  use  and  consistency 

-  Sharing  of  common  design  data 

“  "Master”  geometry  driven  analysis 

-  Standard  foundation  for  numerical  grid  generation 

-  Integration  of  tools  at  different  dimensionalities 

-  Visual  connection  of  tools  and  data  flows 

-  Use,  configuration  &  management  of  design  data 

-  Transparent  support  for  network  communications  &  distributed  processing 

At  the  same  time,  the  MMACE  Team  has  continued  to  provide  support  for  standards  activities,  system 
validation,  tool  integration,  and  system  dissemination. 

The  following  Is  a  summary  of  the  REPs  features  in  each  of  the  areas  listed  .  Each  of  the  areas  is 
addressed  by  a  piece  of  the  architecture  shown  In  figures  1  and  2. 
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Ease  of  Use  and  Consistency 


During  the  initial  implementation  of  the  REF,  a  heavy  emphasis  was  placed  on  making  the  system  usable 
and  extendible  by  design  engineers,  not  software  engineers.  This  work  is  currently  being  enhanced  to 
provide  technology  in  three  key  areas. 

Rrst,  code  wrappering  software  and  a  standard  user  interface  library  is  being  created  so  that  users  can 
easily  place  existing  codes  into  the  framework  and  provide  them  with  a  standardized  appearance  and 
behavior. 

Second,  all  documentation  is  being  placed  on-line  and  made  accessible  over  the  Internet  network.  This 
will  ensure  that  all  users,  regardless  of  their  location,  have  the  latest  and  most  up-to-date  manuals 
available  to  them.  This  will  be  accomplished  by  utilizing  the  NCSA  Mosaic  software,  the  software  of 
choice  for  exploring  the  Internet. 

Third,  a  standard  set  of  visualization  tools  is  being  included  with  the  framework  so  that  code  developers 
need’ not  create  their  own.  The  need  for  proprietary  visualization  tools  will  be  eliminated  and,  with  their 
elimination,  some  of  the  expense  of  developing  new  codes. 

Sharing  of  Common  Design  Data 

A  large  emphasis  has  been  placed  on  sharing  design  data  between  codes.  This  concept  Is  currently 
being  extended  to  allow  additional  types  of  data,  such  as  thermomechanical  or  electromagnetic  data,  to 
be  shared.  This  Is  being  accomplished  by  developing  a  detailed  data  model  and  programmatic  data 
interface.  Users  are  able  to  extend  the  data  mode!  to  access  data  entities  that  are  not  contained  in  the 
data  model  as  delivered.  A  separate  data  model  for  CEM  is  being  create  at  Rome  Laboratory. 


Sharing  of 

Common  Design  Data 


The  MMACE  Framework  will  allow  engineers  working  on 
different  portions  of  the  design  to  share  data _ 


Figure  3  -  Sharing  of  Common  Design  Data 

The  REF  includes  Application  Programming  Interfaces  (APIs)  to  aid  in  the  development  of  design  and 
analysis  codes.  These  APIs  allow  code  developers  to  concentrate  on  the  algorithmic  portions  of  the  code 
and  avoid  the  time  consuming  data  exchange  portions  of  the  coding.  In  addition,  these  APIs  insulate  the 
code  developer  from  changes  to  the  underlying  implementation.  At  present,  the  REF  includes  the 
following  APIs  to  aid  data  sharing: 
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•  Data  API  -  This  programming  interface  provides  direct  read  and  write  access  to  all  of  the  data 
attributes  in  the  REF  database. 

•  Particle  &  Field  API  -  This  programming  Interface  provides  direct  read  and  write  access  to  all  of 
the  particle  and  field  data  in  the  REF  database.  This  API  can  be  used  for  accessing  any  large 
matrix  of  data. 

The  goal  in  this  area  is  to  promote  data  reuse  while  building  on  existing  standards  and  off-the-shelf 
commercial  products.  It  is  not  the  REF  developer’s  intent  to  create  a  new  database  product.  Rather,  the 
REF  will  provide  the  appropriate  layers  so  that  users  can  select  the  pieces  they  desire  without  having  to 
worry  about  the  affect  on  the  analysis  codes. 

Master  Geometry 

One  of  the  goals  of  the  REF  is  to  have  all  codes  utilize  a  single,  consistent  source  to  derive  a  geometry 
model  for  analysis.  This  task  would  be  simplified  if  a  single  CAD  package  were  chosen  and  the 
framework  provided  access  to  that  CAD  package’s  database  to  all  codes.  However,  note  that  the  issue  of 
providing  geometry  to  analysis  and  simulation  codes  is  far  more  complex  than  simply  accessing  a  CAD 
database.  Extensive  work  would  be  required,  regardless  of  the  CAD  package  chosen,  to  enable  a 
heterogeneous  mix  of  codes  to  share  a  geometry  source. 

While  many  large  companies  have  adopted  the  single  CAD  system  approach,  it  has  been  demonstrated 
to  have  several  pitfalls: 

•  The  user  is  tied  to  a  particular  CAD  package  and  cannot  change  without  considerable 
expense. 

•  Enhancements  and  bug  fixes  will  be  made  based  on  the  perceived  market  for  the  CAD 
product.  This  will  usually  not  help  niche  scientific  areas  such  as  those  addressed  by  the 
REF. 

•  Everyone  must  agree  on  the  CAD  package.  This  is  highly  unlikely  due  to  different  business 
requirements,  cost,  corporate  standards,  etc. 

For  the  REF,  the  choice  of  a  CAD  package  is  left  up  to  the  user.  The  REF  takes  the  approach  of  having  a 
buffer  around  the  CAD  engine  so  that  the  geometry  information  can  be  accessed  by  all  codes  regardless 
of  the  specific  CAD  package  used.  To  help  build  this  layer,  the  MMACE  Team  is  using  the  NASA  IGES 
standard  and  the  DTNURBS  library  developed  by  Boeing  Computer  Services  under  contract  from  the 
Navy’s  David  Taylor  Modeling  Basin. 

An  API  has  been  created  to  access  the  geometry  information  regardless  of  the  user's  choice  of  CAD 
package. 

Numerical  Grid  Generation 

A  key  feature  of  the  REF  environment  is  the  support  for  numerical  grid  generation.  In  the  current  design 
environment,  g ridding  software  is  often  tied  to  a  particular  code  and  Is  sometimes  embedded  within  that 
code.  The  objective  of  the  MMACE  program  is  to  develop  standards  for  the  exchange  of  grid  Information 
and,  in  doing  so,  allow  multiple  codes  to  share  grid  information.  In  addition,  by  creating  this  standard 
layer,  users  will  be  able  to  plug-in  the  latest  gridding  software  as  it  becomes  available.  To  test  this,  an 
existing  grid  generator  will  be  adapted  to  use  the  REF  gridding  standard.  This  will  then  be  accessed  by 
both  electromagnetic  and  thermomechanical  codes.  An  example  is  shown  in  figure  4. 

The  REF  grid  generation  requirements  for  both  thermomechanical  (TM)  and  computational 
electromagnetic  (CEM)  analysis  can  be  briefly  summarized  as  follows: 

•  Mixture  of  line,  surface  and  solid  elements  -  node-based  geometric  entities  in  TM,  and  both 
node  and  edge-based  in  CEM. 

•  Steep  gradient  of  grid  density  to  accurately  model  localized  phenomena. 
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P'type  and  H-type  grid  refinements  in  adaptive  gridding 

Specification  of  boundary  codes.,  loadings,  initial  conditions  and  material  properties  (e.g., 
density,  thermal  and  electrical  conductivities,  Young’s  modulus,  Poisson's  ratio,  permeability, 
dielectric  permitivity,  etc.) 

Sliding  Interfaces  and  joints  in  TM,  and  surfaces  of  discontinuities  in  CEM. 

Symmetry  (or  anti-symmetry)  planes  . 

Computed  quantities  for  post  processing  -  for  TM:  temperature,  displacements,  velocities, 
accelerations,  stresses,  bending  moments,  material-dependent  variables,  etc.,  for  CEM. 
electric  and  magnetic  fields,  current  density,  scalar  and  vector  potentials,  etc.  These  may 
involve  complex  numbers.  . 

Rectangular  grids  used  in  the  finite-difference  and  volume-integral  methods  in  CEM. 


These  requirements  are  being  used  as  input  for  the  design  and  development  of  the  REF  gridding 
Interface. 

Integration  of  Codes  at  Different  Dimensionalities 

In  the  existing  MMACE  system  for  power  tube  design,  a  link  was  established  between  the  1 D  parametric 
codes  and  the  2D  geometry  driven  codes.  At  present,  the  link  is  uni-directional  and  does  not 
accommodate  the  users’  3D  requirements. 

An  effort  is  being  made  to  generalize  the  existing  software  to  provide  a  complete  integration  of  codes  at 
different  dimensionalities.  This  is  not  a  trivial  problem.  For  axisymmetric  designs,  going  from  ID 
information  to  2D  information  or  from  2D  to  3D  is  fairly  straightforward.  However,  In  the  other  direction 
(e  g  going  from  a  3D  database  to  a  2D  database),  approximations  must  be  made  which  may  effect  the 
outcome  of  the  analysis.  The  REF  developers  are  taking  a  multifaceted  approach  to  connecting  codes 
and  data  operating  at  different  levels  of  detail.  Basic  methodologies  and  tools  are  being  developed  which 
will  interact  with  the  user  to  select,  abstract,  and  connect/transport  parametric  and  geometric  design  data 
to  codes  at  both  higher  and  lower  dimensionalities. 
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Visual  Connection  of  Tools  and  Data  Flows 


The  REF  includes  a  customizable  control  panel  to  allow  access  to  tools  and  to  visually  display  the  design 
flow.  The  MMACE  Control  Panel,  shown  in  Rgure  5,  is  the  first  thing  the  user  sees  when  accessing  the 
Framework.  While  the  REF  provided  the  underlying  tools  for  creating  the  control  panel,  the  specific 
button  layout  and  dependencies  that  are  shown  are  specific  to  power  tube  design.  Obviously,  the  buttons 
and  layout  would  be  different  for  CEM.  The  control  panel  contains  buttons  for  each  of  the  codes  available 
to  the  user.  These  buttons  are  laid  out  in  the  process  flow  appropriate  to  a  given  project.  Each  design 
could  have  a  different  process.  Notice  that  the  arrows  indicate  the  design  flow  direction  as  well  as 
whether  a  particular  code  is  blocked,  ready,  running,  or  done.  The  user  simply  executes  codes  by 
pushing  buttons  available  on  the  OSF/Motif  compliant  interface.  The  control  pane!  has  been  designed  in 
such  a  way  to  make  it  possible  to  “plug"  in  new  codes  as  needed. 

CEM  design  and  analysis  tools  can  easily  be  integrated  into  the  system  using  the  wrappering  tools 
included  with  the  REF. 

The  control  panel  also  provides  access  to  the  data  visualization  tools  and  to  the  version  control  system. 
The  version  control  system  can  be  configured  to  automatically  archive  key  versions  of  design  data  at 
specified  points  In  the  design  cycle.  This  allows  designers  to  try  several  design  variants  and  then  easily 
locate  and  return  to  the  variant  that  provided  the  best  results.  The  underlying  REF  core  manages  the 
versioning  of  data. 


Figure  5  <  Control  Panel 

Configuration  and  Management  of  Design  Data 

Proper  management  of  design  data  is  critical  to  producing  a  quality  product.  Typically,  design  data  has 
been  maintained  by  the  design  engineer  in  hardcopy  format.  Data  is  often  stored  in  several  filing  cabinets 
and  in  other  places  throughout  the  organization.  It  is  often  difficult  to  find  the  latest  version  of  the  data, 
especially  if  people  involved  in  the  original  design  are  no  longer  around. 

At  present,  a  configuration  and  management  system  is  being  developed  so  that  the  REF  can 
automatically  track  design  data.  The  system  will  keep  electronic  copies  of  different  versions  of  the  data, 
both  for  designs  under  development  and  for  products  that  have  shipped.  Users  will  be  able  to  instantly 
retrieve  data  from  any  version  of  the  product.  In  addition,  by  automatically  keeping  track  of  the  data, 
users  will  be  able  to  explore  different  design  variations  without  having  to  worry  about  overwriting 
information  from  alternate  design  options. 
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Once  a  design  is  complete,  data  for  alternate  design  options  can  be  deleted  to  conserve  space  and  the 
final  database  can  be  archived  in  an  electronic  vault  for  future  reference.  Often,  this  archived  version  will 
be  used  as  the  starting  point  for  upgrades  or  even  for  new  designs  that  are  similar  to  the  one  archived. 
An  index  will  make  it  easy  to  locate  archived  designs. 

The  REF  design  data  management  system  will  be  an  adjunct  to,  not  a  replacement  for,  more  capable  and 
full-featured  corporate  data  management  systems.  The  focus  of  REF  data  administration  capabilities  will 
be  the  support  of  small  R&D  groups  in  the  configuration  and  management  of  their  technical  data. 

Inherent  Networking 

Users  of  the  REF  typically  have  environments  comprised  of  multiple  workstations  and  mainframe 
computers.  Users  want  to  make  use  of  these  resources  by  performing  remote  executions  of  codes  and,  if 
supported  by  the  codes  themselves,  to  perform  distributed  processing.  For  example,  a  user  in  a 
company’s  engineering  facility  might  wish  to  utilize  the  mainframe  resources  available  at  another 
company  facility  over  a  wide  area  network.  The  REF  indudes  a  complete  programmatic  subroutine  library 
allowing  users  to  take  advantage  of  the  high-performance  hardware  distributed  throughout  their 
company’s  network.  In  extreme  cases,  users  could  even  use  this  technology  to  submit  large  processing 
jobs  to  other  companies  across  a  wide  area  network  such  as  the  Internet.  Long  running  CEM  codes 
could  be  set  up  on  a  local  workstation  and  executed  on  a  remote,  massively  parallel  processor.  Data 
post-processing  could  still  be  accomplished  locally.  Unfortunately,  none  of  the  power  tube  design  and 
analysis  codes  that  have  been  integrated  into  the  framework  to  date  are  using  this  networking  capability. 
This  is  probably  due  to  the  fact  that  the  codes  integrated  to  date  have  been  pre-existing  codes  making 
incorporation  of  distributed  and/or  remote  processing  difficult.  It  is  expected  that  this  capability  will  be  of 
greater  use  for  new  design  and  analysis  codes  currently  under  development, 

SUMMARY 

Phase  II  of  the  MMACE  program  will  provide  a  full  research  and  engineering  framework  for  niche 
technical  communities  which  will  approach  the  functionality,  quality,  and  robustness  found  in  commercial 
products.  The  REF  core  development  will  focus  on  some  of  the  problems  not  addressed  adequately  by 
commercial  vendors  because  of  the  typically  small  market  size  of  engineering  and  scientific  communities. 
Specifically,  MMACE  and  its  underlying  REF  core  will  provide  a  low-cost  environment  that  is  easy  to  use, 
has  a  consistent  interface,  promotes  data  sharing,  provides  a  unified  source  of  geometry,  supplies 
standards  for  numerical  grid  generation,  integrates  1 D,  2D,  and  3D  codes,  provides  a  visual  means  for  the 
user  to  view  data  and  data  flows,  performs  configuration  management  of  the  design  data,  and  supports 
the  use  of  heterogeneous  networks  of  computers. 

The  REF  will  provide  the  foundation  for  design  environments  that  are  applicable  to  a  wide  variety  of 
technical  design  disciplines  including,  power  tube  design,  antenna  design,  and  computational 
electromagnetics  among  others.  The  REF  software  is  distributed  at  no  charge^  and  the  developers  are 
eager  to  work  with  anyone  who  desires  to  make  use  of  the  software. 
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Research  &  Engineering  Framework  (REF)  Data  Dictionary 
Specification  for  Computational  Electromagnetics 

Mr  Jeffrey  A.  Evans 
Decision  Science  Applications,  Inc. 

The  Research  and  Engineering  Framework  (REF)  manages  data  for  a  variety  of  design  and 
analysis  codes.  Each  of  these  different  design  and  analysis  codes  have  their  own  data 
requirements  (data  model)  for  input  and  output.  The  method  that  the  REF  uses  to  handle  these 
different  data  models  is  through  the  use  of  a  data  dictionary.  REF  users  will  have  the  ability  to 
switch  between  data  models  thereby  transforming  the  generic  REF  into  a  discipline-specific  design 


This  paper  discusses  the 
design  and  development  of  a 
Computational  Electromagnetic 
(CEM)  specific  data  dictionary 
for  REF.  Types  of  data  covered 
by  this  dictionary  include:  CEM 
Analysis  Techniques;  Geometry 
Elements;  Material  Properties; 
Excitation  Information;  and 
Obsevables  Information.  The 
CEM  data  dictionary  is  being 
developed  as  a  baseline  data 
dictionary  for  the  wide  variety  of 
CEM  codes  available.  A 
developer  need  only  tie  his/her 
application-specific  code  into  the 
REF  directly  or  through 
wrappering  to  access  this  data 
dictionary. 

Common  areas  of  CEM,  Computation  Fluid  Dynamics,  and  Finite  Element  Analysis  for 
Structural  and  Thermal  investigations  were  considered  to  facilitate  transfer  of  information  and  the 
future  development  of  discipline-specific  REF  data  dictionaries.  These  areas  are  of  immediate 
interest  due  to  their  interaction  with  electromagnetics. 

The  REF  data  dictionary  and  its  support  software  then  provide  a  means  for  validating 
application  data  input  and  facilitate  extending  the  data  model  and  dictionary  to  a  variety  of 
engineering  disciplines. 


This  work  was  Sponsored  by  Rome  Laboratory  under  the 
Expert  Science  And  Engineering  Program  (BAA  93-07) 


126 
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Abstract 

This  paper  describes  the  role  of  the  DT_NURBS  Spline  Geometry  Library  in  the  Microwave/Millimeter  Advanced 
Computational  Environment  (MMACE)  system  architecture.  DT_NURBS  provides  a  common  geometry,  gridding,  and 
analysis  data  structure  for  the  MMACE  Research  and  Engineermg  Framework  (REF).  The  DT_NURBS  library  also  offers 
the  CEM  community  the  means  to  integrate  geometry,  gridding,  and  analysis  data  into  a  common  product  model.  This 
product  model  allows  for  multi-discipline  integration  in  a  logical  and  inter-dependent  way. 

Background 

It  has  become  evident  over  the  years  that  geometry  plays  a  vital  role  as  the  common  element  in  the  design  and  analysis  of 
complex  systems.  In  essence,  computational  geometry  is  the  closest  data  representation  of  the  actual  product  of  any  piece 
of  information.  It  is  not  surprising  then  that  when  complex  systems  are  designed,  the  integration  of  performance  data  with 
geometry  becomes  critical.  This  is  most  evident  when  two  dependent  disciplines  interact  with  a  changing  geometry. 
Consider  an  example  where  thermal  characteristics  effect  the  geometry  which  in  turn  vary  electromagnetic  performance. 
Airplane  wing  design  also  requires  a  coupling  of  aerodynamic  load  and  structural  response.  Each  in  turn  affects  the  other 
and  the  resultant  “in-flight”  shape  is  found  through  iteration  to  equilibrium.  As  efforts  at  multi-discipline  integration 
increase  so  will  the  requirements  for  a  robust  and  complete  common  geometry  model. 

In  the  early  80's  some  pioneers  in  integration  schemes  and  math  modeling  techniques  developed  a  mathematical 
characterization  called  Non-Uniform  Rational  B-Splines,  NURBS.  This  NURBS  math  model  is  very  capable  of  modeling 
complex  geometry  structures  as  well  as  correctly  representing  circles,  ellipses,  etc.,  which  could  only  be  approximated  by 
other  splines.  This  allows  for  all  geometry  entities  to  be  based  on  one  math  form  whether  they  are  lines,  arcs,  or  trimmed 
contoured  surfaces  joined  to  formed  a  bounded  solid.  The  NURBS  math  form  is  also  capable  of  representing  other 
geometry  that  may  have  been  created  from  a  different  math  model  (i.e.  Bezier,  uniform  non -rational  B-splines,  Coons 
patches,  CSG  primitives  like  spheres,  cones,  etc.). 

In  1989  a  geometry  library  called  DT_NURBS  was  developed.  The  objective  of  the  library  is  to  provide  a  common 
geometry  math  model  for  the  integration  of  geometrically  dependent  analysis  tools  with  Computer-Aided-Design  systems; 
and  to  provide  tools  for  unique  applications  unsupported  by  commercial  programs.  In  particular,  DT_NURBS  was 
developed  to  provide  the  tools  for  integration  of  computational,  physics  based  codes  and  other  geometrically  dependent 
analysis.  These  tools  include  grid  and  analysis  modeling  as  well  a  more  general  CAD  related  geometry  modeling.  In 
addition,  DT_NURBS  was  designed  to  accommodate  more  difficult  modeling  requirements.  These  include,  but  are  not 
limited  to,  designs  where  geometry,  grids,  and  analysis  varying  with  time,  iteration,  or  configuration.  This  is  accomplished 
through  the  use  of  multi-dimensional  NURBS  based  mathematics. 
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Implementation 

The  implementation  of  DT__NURBS  in  the  engineering  world  could  take  almost  unlimited  form.  In  general,  the 
objective  of  DT_NURBS  is  to  provide  analysis  programs,  or  their  pre-  and  post-processors,  with  a  library  of  routines 
which  allow  these  codes  to  1)  read  CAD  generated  data;  2)  develop,  modify,  and  access  grids;  and  3)  store  the  resultant 
geometry,  grid,  and  analysis  data  in  a  single  topology.  A  DT_NURBS  based  application  would  therefore  contain  and 
control  all  information  relating  to  geometry,  grids,  and  analysis.  Since  analysis  is  dependent  on  grids,  and  grids  on 
geometry,  the  coupling  of  these  elements  in  an  object  oriented  data  structure  is  logical.  In  some  situations,  solvers  need 
access  to  geometry  and  control  of  grids.  Transient  or  unsteady  analysis  of  geometry  varying  with  time  requires  adaptive 
gridding  and  multi-discipline  coupling  in  a  “hands  off’  non-interactive  framework.  The  same  is  true  for  design 
optimization. 

Figure  1  illustrates  this  type  of  configuration  where  two  or  more  applications  directly  communicate  the  same  data 
structure;  thereby  eliminating  the  need  for  data  translation,  or  reformatting.  In  this  configuration,  the  common  geometry 
used  by  both  codes  may  have  been  initially  constructed  on  a  CAD  system.  Once  constructed,  the  geometry  is  saved  in 
IGES  format.  This  format  will  contain  entities  the  original  developer  used  in  constructing  the  geometry  and  its  IGES 
representation  of  those  entities  will  be  set  by  the  CAD  vendor.  Through  an  IGES  filter  program  developed  by  NASA 
called  NIGES,  the  non-NURBS  IGES  entities  are  filtered  or  translated  into  Non-Uniform  Rational  B-Splines,  NURBS.  No 
loss  of  accuracy  occurs  in  this  translation  because  NURBS  are  a  mathematical  superset  of  most  CAD  entities.  Most  CAD 
systems  store  native  data  in  NURBS  format  internally  as  well.  Once  a  NURBS  based  IGES  file  is  generated  the  geometry 
can  be  read  by  both  applications.  Theoretically,  DT_NURBS  can  read  an  unfiltered  IGES  file.  However,  not  all  entities 
can  be  automatically  used  in  calls  to  DT_NURBS  routines.  DT_NURBS  actually  supports  an  extension  of  the  NASA 
standard.  In  the  figure  it  is  referred  to  as  DT/NIGES.  These  DT  extensions  to  the  NASA-IGES  standard  are  stored  as  user 
defined  entities  and  are  ignored  by  most  CAD  systems.  In  this  area  of  the  IGES  file,  DT_NURBS  stores  geometry,  grids, 
and  analysis  data  in  a  mapped  object  topology,  called  Geometiy/Grid/Analysis  Objects  or  GGA.  Because  IGES  is  large 
and  not  efficient  for  use  as  a  common  flat  file  format,  the  DTRC  native  format  will  generally  be  used  when  sharing  data 
between  applications.  A  DT/NIGES  file  or  standard  IGES  file  can  be  written  at  any  time  for  communication  with  other 
CAD  related  applications.  The  main  use  of  the  DT/NIGES  file  format  is  for  exchange  of  a  single  data  file  where  some 
applications  may  be  ‘DT_NURBS  aware’  while  other  applications  may  only  be  able  to  use  the  common  geometiy  stored  as 
standard  IGES  entities.  The  association  between  common  geometry  (standard  IGES)  and  GGA  objects  (DT/NIGES)  is 
maintained  by  being  in  one  IGES  file.  Future  DT  NURBS  support  for  the  STEP  standard  will  improve  translation  of  B-rep 
solids  and  other  product  model  type  data.  GGA  type  data  will  still  require  a  separate  Application  Protocol  since  none  of 
the  existing  AP’s  currently  support  GGA  functionality. 


FIGURE  1  -  DT_NURBS  BASED  INTEGRATION 

In  many  situations,  however,  it  may  not  be  prudent  to  modify  analysis  codes  to  accommodate  the  DT_NURBS  data 
structure  and  GGA  topology.  At  least  not  initially.  Cost,  time,  size,  maintenance,  or  other  software  development  issues 
may  preclude  the  inclusion  of  DT_NURBS  data  structures  internal  to  a  code.  This  forces  the  GGA  object  topology  to 
reside  outside  the  applications  themselves.  In  these  situations  DT_NURBS  can  act  as  an  toolkit  or  foundation  for  an 
Application  Program  Interface,  or  API,  between  applications  and  the  data  base  (see  Figure  2).  The  best  way  to  visualize 
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this  architecture  is  to  think  of  DT_NURBS  as  a  toolbox  that  allows  for  the  construction,  evaluation,  manipulation,  and 
storage  of  geometry,  grids,  and  analysis.  In  this  toolbox,  DT_NURBS  ensures  the  dependency  between  geometry,  grids, 
analysis,  and  attribute  data,  but  requires  the  maintenance  to  occur  within  application  wrappers  by  way  of  general  access 
API’s.  It  is  not  the  intent  to  describe  completely  the  different  architectures  available  to  DT_NURBS,  but  rather  to 
illustrate  its  capabilities  in  such  an  environment.  The  MMACE  architecture  is  very  similar  to  this  latter  configuration. 

The  MMACE  data  base  description  is  outlined  in  more  detail  in  Reference  1. 


System  Applications 


User  Applications 


Input  Output 
Files  ^  Files 


REF  WRAPPER 


System  API 


Grid  Geometry  Analysis  Properties  Control 
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REF  Database  API  Layer 
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Flat  Files  | 

REF  Database  Architecture 


FIGURE  2  -  REF  DATABASE  ARCHITECTURE 


The  DT_NURBS  library  provides  the  MMACE  with  essential  components  to  the  Geometry,  Gridding,  and  Analysis 
Application  Program  Interface  (API).  DT^NURBS  provides  a  suite  of  subroutines  which  give  the  programmer  a  wide 
range  of  capabilities;  from  simple  curve  fitting  to  modeling,  manipulating,  or  evaluating  solid  objects  containing  geometry, 
grid,  and  analysis  data.  One  unique  capability  of  DT_NURBS  is  the  ability  to  literally  place  grid  and  analysis  data  inside 
geometry  through  the  use  of  GGA  objects. 
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Geometry/Grid/Analysis  Objects 

DT_NURBS  models  common  geometry,  grids,  and  analysis  in  an  object  oriented  topology  where  the  dependence  between 
geometry,  grids,  and  analysis  solutions  are  maintained.  This  architecture  is  called  Geometry/  Grid/  Analysis 
Objects,  or  GGA.  [Ref.  2] 

To  understand  how  GGA  objects  function  it  is  necessary  to  outline  some  of  the  unique  features  of  DT_NURBS  that  allow 
GGA  objects  to  operate  as  they  do.  Most  DT_NURBS  entities  have  unlimited  dimensionality.  For  example,  a  point  is 
usually  considered  to  be  3  dimensional,  but  a  vector  has  variable  length.  A  vector  can  model  all  points,  but  points  caimot 
define  all  vectors.  Using  the  word  ‘point’  has  some  meaning  beyond  the  raw  data;  but  in  reality  it  is  just  a  vector  of  data. 
We  typically  associate  point  with  a  location  in  Cartesain  space.  From  a  data  modeling  perspective  it  is  just  a  vector  of 
length  three  containing  real  data.  This  is  the  easiest  way  to  think  of  the  differences  between  DT_NURBS  modeling  and 
CAD  modeling.  DT_NURBS  operates  on  data;  some  of  which  happen  to  be  used  for  the  construction  of  geometry.  In 
CAD  systems,  all  data  assumes  it  is  associated  with  or  defining  geometry.  DT_NURBS  would  model  points  as  vectors.  If 
the  user  only  wants  to  store  three  Cartesian  data  values  that  is  perfectly  acceptable.  Should  the  user  decide  to  store 
temperature  and  pressure  along  with  the  Cartesian  data  DT_NURBS  will  allow  it,  CAD  systems  won’t.  If  the  user  should 
choose  to  fit  a  surface  or  solid  through  those  vector  points  DT_NURBS  can  do  that.  If  the  user  then  chooses  to  view  the 
model  as  a  3D  image,  the  three  of  the  five  parameters  would  be  chosen.  If  it  happens  to  be  (x,y,z)  then  only  geometry  is 
plotted. 


DT_NURBS  has  the  ability  to  specify  the  dimensionality  of  its  entities  to  any  dimension.  It  therefore  has  the  ability  to  act 
as  a  superset  of  traditional  CAD  entities.  Secondly,  spline  entities  like  curves  and  surfaces  have  independent  parameters 
that  the  spline  entity  is  normalized  to.  For  a  curve  it  could  be  curve  length.  Traditional  spline  curve  entities  have  one 
independent  variable,  surfaces  have  tw'O,  and  solids  have  three.  DT_NURBS  has  no  limit  on  the  number  of  independent 
variables  for  the  parameterization  of  these  entities.  Consider  a  traditional  surface.  Normally  it  has  three  dependent 
variables,  namely,  (x,y,z).  When  a  surface  is  constructed  it  is  traditionally  parameterized  by  two  independent  variables 
(u,v),  see  Figure  3. 


FIGURE  3  -  TRADITIONAL  SURFACE  PARAMETERS 

DT_NURBS  allows  an  unlimited  number  of  dimensions  for  both  dependent  and  independent  variables.  What  this  means  is 
the  we  can  add  parameters  such  as  iteration,  time  step,  or  configuration  to  the  traditional  independent  parameters  used  by 
CAD  systems.  In  this  situation,  the  dependent  variables,  (x,y,z),  can  be  evaluated  given  a  set  of  independent  variables,  or 
parametric  location  (u,v,config, time, iter).  Written  backwards,  eveiy'  point  on  the  geometry  is  defined  by  it  location  along 
(u,v)  at  a  particular  iteration,  during  a  single  time  step,  of  one  of  the  design  configurations.  This  is  a  simple  case  of 
variational  geometry  modeled  as  a  single  geometric  entity.  Consider  again  the  case  where  geometry  is  changing  with 
iteration,  time,  and  configuration.  Figure  4  depicts  a  hierarchy  of  data  structure  showing  iterations  per  time  step  and  time 
step  simulations  per  configuration.  Perhaps  we  have  an  optimization  program  controlling  design  configuration.  Each 
design  geometry  goes  through  a  simulation  where  it  may  be  changing  with  time,  perhaps  because  of  environmental  or 
operational  conditions.  Between  each  time  step  the  geometry  iterates  to  convergence  between  two  analysis  applications 
such  as  mechanical  and  thermal. 
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FIGURE  4  -  GEOMETRY  CHANGING  WITH  ITERATION,  TIME,  AND  CONFIGURATION 

This  is  just  one  example  of  the  possible  topologies  available  through  DT_NURBS.  For  example,  if  we  add  the  parameter 
(w)  we  can  model  a  solid  changing  with  these  parameters  (u,v,w,config,time,iter)  as  well.  The  DT_NURBS  representation 
more  closely  models  geometric  reality.  What  is  the  geometry  of  a  balloon?  When  its  inflated,  deflated,  deflating,  or 
blowing  in  the  wind?  In  a  CAD  system  you  will  get  to  model  only  one  representation. 

GGA  objects  extend  the  ability  to  model  variational  geometry  to  include  to  storage  of  grid  and  analysis  data  along  with  the 
geometry.  The  GGA  object  is  based  on  an  hierarchical  object  architecture  where  objects  have  the  same  mathematical 
representation  and  dimensionality  but  have  dependencies  on  each  other  through  mappings.  Grids  are  imbedded  in 
surfaces  or  solids,  and  analysis  is  modeled  as  surface  or  solid  interpolations  of  solution  data  stored  at  grid  points. 
Geometry,  grid,  and  analysis  data  may  change  content  with  time,  iteration,  or  any  number  of  additional  parameters  without 
losing  their  dependence  on  each  other. 

Consider  the  previous  example  of  the  geometry  varying  with  iteration,  time,  and  configuration.  Suppose  now  that  we 
attach  a  grid  that  is  ‘stuck’  or  mapped  to  this  geometry.  This  grid  contains  vector  nodes  that  can  hold  geometric  and  non¬ 
geometric  data  at  each  node.  The  non-geometric  data  could  contain  analysis  data,  property  data,  or  any  data  that  varies 
over  the  domain  of  the  geometry.  Suppose  again  that  we  fit  a  surface  or  solid  through  these  vector  nodes.  This  new  entity 
will  now  interpolate  and  mathematically  define  the  data  over  the  entire  grid  domain.  Again  this  analysis  entity  is  ‘stuck’  or 
mapped,  like  the  grid,  to  the  parent  geometry.  As  the  geometry  changes  so  does  the  grid  and  analysis  data.  For  simple 
grids  that  contain  vector  nodes  of  just  parametric  geometric  data,  no  additional  information  is  required  as  the  geometry 
changes;  unless  of  course  we  choose  to  change  or  adapt  the  grid  for  each  iteration,  time  step,  and  configuration.  In  this 
scenario,  however,  we  are  also  storing  non-geometric  or  analysis  data  at  each  node.  Therefore,  the  grid  will  have  the  same 
independent  dimensionality  as  the  geometry.  Its  information  will  change  with  iteration,  time,  and  configuration.  This 
same  analogy  holds  true  for  the  analysis  entity  except  that  as  a  spline  function  there  are  some  differences  in 
dimensionality. 

Keep  in  mind  that  in  this  example  there  are  only  three  individual  entities  in  this  entire  simulation.  This  allows  for  coupling 
of  multi-discipline  analysis  for  steady  or  unsteady  analysis.  Each  multi-discipline  application  that  implements  a  GGA  dam 
structure  can  define,  extract,  or  interrogate  the  analytic  geometry  for  solution  data  derived  from  another  application.  This 
point  needs  re-emphasizing.  A  grid  point  for  an  application,  say  finite  element  analysis,  requires  a  pressure  and 
temperature.  By  pointing  to  a  parametric  location  on  the  parent  geometry,  (u,v,w,config,time,iter),  it  can  request  this 
information  from  an  analysis  entity(s)  that  contains  this  information.  This  other  entity  is  also  mapped  to  the  same 
geometry  and  shares  the  same  geometry  parameterization.  When  two  or  more  different,  but  dependent  applications  are 
structured  this  way,  communication  between  them  is  immediate  and  automatic.  These  are  fundamental  requirements  for 
optimization,  design  synthesis,  and  unsteady  or  transient  simulations.  Additional  benefits  of  the  dependent  nature  of  GGA 
objects  is  configuration  control.  The  grid  and  analysis  data  is  mapped  to  the  parent  geometry.  There  is  no  question  about 
which  geometry  the  analysis  was  performed  on  and  what  grids  were  used.  Also,  a  different  kind  of  product  model  is  now 
available;  one  that  contains  solution  data,  performance  information,  and  property  data  that  varies  over  the  geometry. 
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An  Example  Problem 

The  following  problem  will  illustrate  how  one  discipline,  in  this  case  thermal  analysis,  could  go  about  modeling  using 
GGA  objects  for  use  by  two  other  disciplines,  mechanical  analysis  and  electromagnetic  (EM)  analysis.  Our  problem 
involves  the  transient  conditions  existing  within  a  helix  traveling  wave  tube  (TWT)  during  warm  up.  A  simplified  helix 
TWT,  shown  in  Figure  5,  consists  of  a  gun,  slow-wave  structure  (helix),  collector,  and  RF  circuit.  The  gun  assembly, 
consisting  of  the  heater  bucket  (heater),  cathode,  focus  electrode,  anode,  and  related  support  structure,  is  used  for  our 
example.  As  the  gun  components  heat  up,  the  materials  expand,  causing  deformations  of  the  various  structures.  The  gun 
component  geometry  must  be  maintained  in  a  constant  relative  position  during  warm  up  and  operation. 


RF  input  Slow-wave  structure  RF  output 


FIGURE  5  -  SIMPLIFIED  DRAWING  OF  A  HELIX  TWT  [REF.  3] 

The  TWT  component  warm  up  is  a  transient  event  requiring  both  the  thermal  analysis  code  and  mechanical  analysis  code 
to  iterate  in  an  automated  fashion  at  each  time  step  so  that  component  deformations  due  to  thermally  induced  stresses  can 
be  determined.  The  EM  analysis  then  must  compute  the  resulting  electric  field  for  each  deformed  geometry  at  each  time 
step.  The  analysis  domain  used  for  this  example  is  shown  in  Figure  7.  A  possible  GGA  object  design  process 
implementation  is  described  to  show  how  the  data  exchange  problem  between  analysis  codes  is  made  possible. 


FIGURE  6  -  ANALYSIS  DOMAIN  OF  A  TWT  GUN 

To  accomplish  the  TM  and  EM  analyses,  we  need  a  method  of  integrating  the  gun  geometry  as  a  function  of  time,  the  grids 
used  in  each  of  the  three  analyses,  and  a  method  of  storing  the  results  of  the  three  analyses.  Given  these,  thermal  analysts 
will  calculate  the  temperature  distribution  within  the  cathode,  focus  electrode,  anode,  heater,  and  support  structure 
assemblies  at  each  time  interval  and  will  store  temperature  data  at  every  time  step.  The  mechanical  analysts  will  then 
compute  the  structural  deformations  due  to  the  temperature  distribution  effects  on  the  temperature-dependent  mechanical 
properties  of  the  component  materials.  The  EM  analysts,  in  turn,  use  the  deformed  component  geometry  to  model  the 
region  bounded  by  the  gun  components  to  update  the  electric  field  and  the  resulting  effect  on  the  electron  beam.  Note  that 
the  EM  analysts  do  not  directly  need  any  computational  results  from  the  TM  computations,  but  only  the  resulting  deformed 
geometry  computed  by  the  mechanical  solver. 

The  data  for  the  modeling  requirements  can  be  defined  as:  (1)  a  geometry  (or  geometries);  (2)  a  grid  for  each  of  the 
required  analysis,  thermal,  mechanical,  and  EM,  that  is  appropriate  to  each  solver  being  used;  and  finally,  (3)  the  results  of 


132 


analysis.  In  a  traditional  mode  of  operation  the  data  is  treated  separately  and  it  is  up  to  the  designer/analyst  to  manage  the 
interconnections.  We  want  to  facilitate  management  of  the  disparate  data.  Specifically,  we  want  the  integration  to 
accomplish  three  things.  First,  we  want  all  the  relevant  data  to  be  stored  in  a  compact  and  efficient  form.  Second,  we  want 
the  time  dependency  integrated  seamlessly.  This  means  that,  for  example,  at  any  point  on  the  gun  geometry,  and  at  any 
time  during  warm-up,  the  mechanical  solver  should  be  able  to  easily  determine  the  temperature,  density,  or  any  other 
computed  non-geometric  or  performance  variable.  Lastly,  performance  data  should  be  stored  locally,  associated  only  with 
the  corresponding  component  or  region,  and  not  stored  globally. 

It  is  important  to  note  one  of  the  reasons  why  traditional  approaches  to  geometry,  grids,  and  analysis  fail  to  provide  the 
seamless  integration  required.  For  example,  traditional  geometry  is  restricted  to  planar  and  space  curves  described  by  a 
single  independent  parameter,  and  to  surfaces  described  by  two  parameters.  However,  the  problems  we  are  discussing 
require  at  least  three  or  four  independent  parameters.  Solids  require  three  parameters  for  geometry  and  one  for  time.  Hiis 
means  that  standard  representations  such  as  those  provided  by  IGES  or  PDES/STEP  are  not  sufficient  as  a  means  of  storing 
or  sharing  this  type  of  data.  Next,  the  data  we  are  attempting  to  integrate  is  inherently  multi-dimensional  as  it  pertains  to 
dependent  variables.  Not  only  do  we  want  to  deal  with  the  traditional  three  coordinates  of  geometry  but  we  also  want  to 
add  analysis  data  such  as  temperature  or  potential. 


Integration  needs  a  method  for  treating  many  independent  and  many  dependent  variables  as  single  entities  and  a  method 
for  treating  subsets  of  geometry  without  having  to  decompose  the  overall  geometry  into  pieces.  GGA  objects  can  be  solid, 
surface,  or  curve  entities.  These  entities  can  be  imbedded  within  one  another  in  a  parent/child  relationship.  The  child 
objects  are  imbedded  and  move  with  the  parent  entity.  For  example,  the  human  arm  is  a  solid  object  containing  veins, 
muscles,  bones,  and  nerves.  As  the  arm  moves  with  time,  the  shape  and  orientation  of  the  arm  changes,  but  the  body  parts 
within  move  along  with  it.  In  the  case  of  the  TWT  gun,  the  parent  geometry  can  be  modeled  as  a  solid  defining  the  entire 
gun  domain  bemg  analyzed.  Within  that  parent,  the  child  geometries,  or  components  -  modeled  as  either  solids,  surfaces, 
curves,  and  grids  -  can  change  position  and  shape.  The  geometry  decomposition  for  the  TWT  gun  is  shown  in  Figure  7. 
The  choice  of  entity  type  is  a  function  of  the  topology  of  the  region  to  be  studied.  For  example,  in  modeling  the  arm,  a 
choice  of  solid  bones,  tube-surface  veins,  and  curves  for  nerves  might  be  used.  The  same  analogy  is  true  for  the  modeling 
of  TWT  guns.  In  this  example  we  might  model  the  parent  gun  geometry  as  a  solid.  The  child  entities  -  cathode,  anode, 
heater,  and  focus  electrode  -  would  also  be  solids.  The  support  structures  would  be  surfaces  whose  common  boundaries 
with  their  respective  attached  components  are  curves.  In  addition,  each  entity,  solid,  surface,  curve,  or  grid,  can  include 
non-geometric  data  that  can  vary  with  that  entity.  The  electrode,  heater,  cathode,  support  structures,  and  anode  require  TM 
type  data  such  as  temperature,  stress,  strain,  and  density.  The  bounded  EM  region  may  include  electric  field  potential  or 
other  variables. 


FIGURE  7  -  PARENT  AND  CHILD  TOPOLOGY  FOR  TWT  GUN 

For  the  gun,  we  want  to  perform  three  type  of  analysis.  The  type  of  grid  used  for  each  analysis  is  subject  to  the 
requirements  of  each  solver.  In  the  thermal  analysis  case  we  may  choose  to  use  a  structured  grid  and  in  the  mechanical 
analysis  case  we  may  want  a  unstructured  grid.  For  EM  analysis  we  could  use  a  topological  rectangular  grid.  All  different 
grid  types  are  supported  in  the  GGA  architecture.  Figure  8  illustrates  how  different  grids  are  mapped  to  their  respective 
child  components.  As  evident  by  the  figure,  not  all  components  need  to  be  physical  geometries.  In  this  figure  the  bounded 
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EM  region  is  also  defined  as  a  child  entity.  It  shares  common  boundaries  with  other  child  components  thus  ensuring  a 
robust  topology.  The  common  edges,  or  surfaces,  provide  the  means  to  transfer  or  communicate  analysis  data  and  also 
form  boundaries  for  gridding.  Additionally,  these  common  boundaries  can  be  used  to  establish  common  grid  points  for 
both  components  if  this  is  required. 


FIGURE  8  -  GRIDS  AND  COMPONENT  ENTITIES  FOR  TWT  GUN 

The  solution  to  the  TWT  gun  analysis  problem  then  consists  of  first  creating  a  single  geometry  of  the  TWT  gun,  possibly 
extracted  from  a  common  IGES  file  of  the  geometry.  Each  discipline  then  creates  the  grid  desired.  GGA  objects  are 
formed  by  decomposing  the  geometry  into  parent/child  entities.  The  child’s  entities  include  geometry  and  grids.  The 
grids  are  defined  with  appropriate  vector  lengths  necessary  for  the  resultant  analysis  data.  The  three  analysis  codes  then  are 
executed.  The  thermal  solver  performs  the  temperature  solution  at  the  first  time  step,  handing  control  over  to  the 
mechanical  solver,  which  can  automatically  extracts  the  temperature  values  needed  to  drive  deformations  from  the  thermal 
differences.  The  deformations  to  the  mechanical  grid  are  used  to  reshape  the  component  geometry.  The  EM  solver  then 
takes  the  new  deformed  geometry  as  the  new  starting  point  and  computes  the  electric  field  and  the  resulting  electron  beam 
path.  These  data  exchanges  and  inter-disciplinary  iterations  continue  until  the  TWT  reaches  steady  state  conditions.  The 
path  of  the  electron  beam  will  then  be  determined  for  satisfactory  positioning  throughout  the  warm  up  transient.  This 
process  allows  codes  to  proceed  through  the  entire  simulation  without  need  for  mtervention.  Each  analysis  still  uses  the 
grids  necessary  to  achieve  optimum  results,  and  the  solution  data  is  now  stored  with  the  geometry  and  grids. 

Summary 

The  use  of  the  GGA  objects  allows  the  integration  of  analysis  codes  in  such  a  way  that  the  performance  data  required  to  be 
transmitted  through  grids  and  geometry  can  be  done  without  the  need  to  force  grid  compatability  among  the  codes.  The 
implementation  of  GGA  object  based  architectures  enables  multi-disciplinary  analysis  to  be  seamlessly  and  generically 
performed  across  any  set  of  computational  disciplines.  The  DT_NURBS  library  provides  the  MMACE  REF  with  essential 
components  to  the  Geometry /Gridding  Application  Program  Interface  (API).  Similarly,  DT_NURBS  routine  can  be 
embedded  in  any  physics  based  code  such  as  thermal-mechanical  and  CEM.  This  offers  a  highly  coupled  relationship 
between  individual  codes  and  MMACE.  Optimization  techniques  are  also  possible  in  this  environment.  Additional 
benefits  of  the  dependent  nature  of  GGA  objects  is  configuration  control,  and  a  product  model  that  contains  solution  data 
and  other  performance  information. 
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Introduction.  In  the  analysis  of  microwave  devices,  the  complex  geometry  inherent  to  these  systems 
mandates  its  discretization  into  smaller,  and  more  tractable  pieces.  A  mesh  of  grid  points  (or  nodes) 
and  the  connectivity  of  these  nodes  to  form  elements  must  be  generated  to  describe  the  geometry. 
The  nodes  and  elements  possess  certain  attributes:  physical  and  material  properties,  load,  constraints, 
etc.  The  attributes  may  be  scalars,  vectors,  tensors,  or  fields.  In  addition,  there  may  be  attributes 
associated  with  the  discretized  model,  and  certain  information  which  controls  the  analysis.  After  an 
analysis  is  completed,  there  will  be  computed  quantities  associated  with  the  grid  points,  the  elements 
and  the  model.  Examples  of  discrete  analytical  methodologies  include  finite  element,  finite  difference, 
boundary  and  volume  elements.  Current  gridding  practice  of  most  analysis  codes  can  be  summarised  as: 

•  Gridding  algorithms  are  often  embedded  into  individual  codes, 

•  Grids  are  tightly  coupled  to  specific  codes. 

Thus  the  mesh  generated  for  one  code  cannot  be  readily  utilized  by  another  code. 

The  analysis  codes  thus  far  integrated  into  the  Research  and  Engineering  Framework  (REF)  of  the 
Microwave  and  Millimeter  Wave  Advanced  Computational  Environment  (MMACE)  can  be  divided  into 
2  broad  categories:  thermomechanical  (TM)  and  computational  electromagnetic  (CEM).  The  numerical 
grid  and  solution  data  from  these  codes  can  be  stored  in  the  REF  common  database,  either  as  a  complete 
archive  of  the  design  process,  or  to  facilitate  the  reuse  of  the  same  grid  data  between  different  analysis 
codes.  To  achieve  these  goals,  we  are  developing  a  top-level  grid  Application  Programming  Interface 
(API)  that  wiU  serve  as  an  interface  between  the  analysis  tools  that  are  being  developed  and  the  REF. 
The  Grid  API  allows  different  pre-  and  post-processors,  and  analysis  codes  to  be  easily  integrated  or 
replace  existing  codes,  without  affecting  other  codes  in  the  framework.  Grid  and  solution  data  can  be 
readily  retrieved  from  or  stored  in  the  common  database  via  the  API.  DT_NURBS[1],  a  B-spline  based 
geometry  library  from  the  David  Taylor  Model  Basin,  not  only  provides  a  robust  and  comprehensive  set 
of  geometry  manipulation  functions,  but  it  also  affords  an  excellent  method  for  representing  geometry, 
grid  data  and  associated  attributes,  and  other  information  required  by  the  various  types  of  analysis 
codes.  This  paper  provides  a  brief  overview  of  the  requirements  of  CEM  and  TM  grid  and  solution 
data,  functionality  of  the  REF  Grid  API  and  the  use  of  the  DT_NURBS  data  structure  for  accessing  and 
manipulating  the  mesh  data. 

Grid  Generation.  Grids  can  be  basically  divided  into  three  groups:  structured,  unstructured,  and 
hybrid. 

Structured,  Grids.  We  let  (x,y,z)  and  (^,7/,C)  denote  the  co-ordinates  in  the  physical  and  the  computa¬ 
tional  space  respectively;  then  the  grid  generation  process  involves  creating  a  network  of  lines  of  constant 
^,7?,  and  ^  such  that  a  one-to-one  mapping  can  be  established  between  physical  and  computational  space. 
The  physical  space  is  mapped  into  a  square  or  rectangle  in  two  dimensions,  and  into  a  cube  in  three 
dimensions,  as  in  Figure  1. 

Structured  grids  are  primarily  divided  into  two  groups,  Cartesian  and  curvilinear  grids,  which  are 
shown  in  Figure  2.  The  domain  of  Cartesian  grids  is  decomposed  by  vertical  and  horizontal  lines,  and 
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boundaries  are  approximated  by  tracing  where  the  grid  lines  meet  the  body.  The  main  drawback  of  this 
type  of  grid  is  that  the  application  of  boundary  conditions  is  difficult.  On  the  other  hand,  the  co-ordinate 
surfaces  of  curvilinear  grids  coincide  with  all  boundaries. 


4  i^<v) 


Figure  1:  Mapping  Computational  Space  (Left)  into  Physical  Space  (Right). 


Figure  2:  Cartesian  (Left)  and  Curvilinear  (Right)  Grids. 

Structured  grid  generation  methodologies  can  be  grouped  into  two  main  categories: 

•  Direct  methods,  where  algebraic  interpolation  techniques  are  utilized  -  Transfinite  interpolation, 
the  most  widely  used  algebraic  grid  generation  method,  is  based  on  Langrange  and  Hermite  inter¬ 
polation.  Recently  the  application  of  non-uniform  rational  B- Splines  (NURBS)  has  revolutionized 
direct  methods  of  generating  grids. 

•  Indirect  methods,  where  a  set  of  partial  differential  equations  are  solved  to  generate  the  grid  -  The 
Poisson  equation  is  solved  with  different  control  functions,  which  controls  the  grid  spacing  and 
smoothness  of  the  grids,  to  generate  the  grid. 

Unstructured  Grids.  Unstructured  grids  are  composed  of  triangular  cells  in  two  dimensions  and  tetrahe¬ 
dral  cells  in  three  dimensions,  as  shown  in  Figure  3.  The  grid  information  is  stored  in  a  set  of  co-ordinates 
called  nodes,  and  the  connectivity,  that  indicates  which  nodes  are  connected  to  the  cell.  The  advantage 
of  the  unstructured  grid  is  that  it  can  be  generated  around  complicated  shapes  with  ease. 

Hybrid  Grids.  Structured  and  unstructured  grids  may  be  combined  to  form  hybrid  grids,  in  which  part 
of  the  grid  is  structured  and  other  parts  unstructured  (Figure  3).  Hybrid  grids  can  be  generated  either 
by  creating  structured  grids  around  the  body  and  unstructured  grids  away  from  the  body,  or  by  creating 
structured  grids  around  different  components  and  then  connecting  them  by  using  unstructured  grids. 
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Figure  3:  Unstructured  (Left)  and  Hybrid  (Right)  Grids. 


In  most  practical  computations,  grid  points  are  clustered  near  the  region  where  there  is  a  large 
gradient  in  the  computed  variables. 

Requirements  of  CEM  Grid  and  Solution  Data.  The  two  dominant  themes  in  CEM  are  integral 
equations  and  differential  equations.  The  method  of  multigrids  is  becoming  increasingly  used  in  solving 
the  discretized  differential  equations  when  the  grids  are  regular,  as  in  the  case  of  the  finite  difference 
(FD)  method. [2] 

The  finite- difference  time-domain  (FDTD)  method  is  often  used  when  solving  transient,  or  broad¬ 
band  problems,  especially  those  associated  with  pulse  propagation  in  digital  and  microwave  circuits. 
The  FDTD  method  typically  uses  structured  grids.  However,  the  use  of  unstructured  grids  is  gaining 
popularity  with  the  new  generation  of  FDTD  codes. 

The  finite  element  method  (FEM)  [11],  [12],  although  applicable  to  time- domain  problems,  seems 
to  find  its  greatest  use  in  the  frequency  domain,  when  dealing  with  geometries  of  arbitrary  shape,  and 
steady-state  calculations  at  a  single  frequency.  Three-dimensional  FEM  codes  generally  use  unstructured 
grids. 

Structured  and  regular  grids  are  much  more  computationally  efficient,  since  all  arrays  are  directly 
addressed,  and  perform  more  efficiently  on  vector,  as  weU  as  RISC,  processors.  They  are  limited,  however, 
in  the  geometries  that  they  can  accurately  model. 

In  view  of  the  fact  that  the  method  of  differential  equations  is  used  in  the  majority  of  the  CEM  codes, 
which  includes  all  the  codes  thus  far  integrated  into  the  REF,  we  will  limit  our  discussions  to  the  FD  and 
FE  methods  in  this  section.  Furthermore,  this  allows  us  to  develop  a  grid  API  that  will  be  immediately 
useful  to  the  TM  community  within  MMACE,  because  all  such  TM  codes  are  of  the  FE  genre. 

In  spite  of  the  different  numerical  algorithms  employed  by  FD  and  FE  solvers,  the  solvers  in  both 
methods  operate  on  similar  sets  of  initial  data:  a  mesh.  A  mesh  consists  of  geometric  data  (nodes  and 
elements),  material  properties,  boundary  conditions,  and  all  other  information  required  to  describe  the 
physics  of  the  problem  to  be  solved.  In  addition,  control  information,  which  may  include  analysis  options, 
output  flags,  etc,  must  be  supplied  to  the  analysis  code.  In  some  FD  codes,  the  geometry  is  parametrized 
as  lines,  arcs,  surfaces,  etc,  and  is  read  in  as  part  of  the  input;  grid  generation  is  an  inherent  part  of  the 
analysis  code.  The  complexity  of  the  geometry  of  microwave  devices  reduces  the  usefulness  of  such  codes 
in  the  REF. 

Nodes  and  Elements.  Grid  data  can  be  divided  into  2  parts:  nodes  and  elements.  In  FD,  the  element 
connectivity  is  strictly  node-based;  i.e.,  each  element  is  completely  defined  by  a  list  of  node  numbers. 
For  example,  one  element  in  Figure  4  is  defined  by  the  set  of  nodes  {1, 2, 5, 4, 7, 8, 11, 10},  and  the  second 
element  by  {2,3,6,5,8,9,12,11}. 

In  the  finite- element  analysis  of  thermomechanical  problems,  we  usually  expand  the  unknown  fields 
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Figure  4:  3D  Nodal-Based  (Left)  and  Edge-Based  (Right)  Elements. 

using  scalar  interpolating  basis  functions,  with  the  unknown  expansion  coefRcients  being  three- component 
vectors;  indeed,  these  expansion  coefficients  are  usually  the  values  of  the  fields  at  the  nodes  of  the 
interpolating  functions,  which  are  the  nodes  of  the  grid.  This  is  satisfactory  if  the  fields  are  everywhere 
continuous,  such  as  the  displacement  field  in  a  problem  of  structural  analysis,  or  fluid  flow.  If,  however, 
there  are  sliding  interfaces  involved  in  the  problem,  then  the  Grid  API  must  identify  the  associated 
nodes,  so  that  the  analysis  code  can  treat  them  correctly. 

In  an  electromagnetics  problem,  however,  the  fields  of  interest  are  not  everywhere  continuous;  only  the 
tangential  components  of  the  electric  held,  E,  and  the  magnetic  held,  H,  are  continuous,  in  the  absence 
of  internal  singular  sources.  Hence,  the  elements  of  hnite-element  codes  in  CEM  are  not  node-based; 
they  are  are  edge-based  [2]- [4].  When  solving  an  electromagnetics  problem  using  edge-elements,  one 
uses  vector  interpolating  basis  functions  (the  edge- elements)  together  with  scalar  expansion  coefficients 
to  dehne  the  held.  The  scalar  expansion  coefficient  is  some  property  of  the  electromagnetic  held,  such 
as  the  normal  component  of  the  current  across  an  edge,  that  is  continuous,  and  this  is  the  reason 
such  functions  are  called  edge  elements.  In  volume-integral  equations,  one  uses  facet  elements  as  the 
three-dimensional  analog  of  edge  elements.  Facet  elements  are  vector  expansion  functions,  whose  scalar 
expansion  coefficients  are  the  normal  component  of  current  through  the  corresponding  face  (facet)  of  an 
element. 

Each  edge  element  is  dehned  by  the  edges  (edges  in  2D  and  facets  in  3D),  which  enclose  the 
area/ volume  of  the  element.  Thus  the  hrst  element  in  Figure  4  is  dehned  by  the  edge-connectivity, 

^  shown  in  the  hgure  on  the  right. 

The  use  of  edge- elements,  and  the  associated  edge-connectivity  description,  in  the  hnite-element 
method  in  CEM  is  numerically  superior  to  node-based  connectivity,  especially  in  dynamic  held  problems, 
for  tw'o  reasons:  (1)  when  solving  for  the  solenoidal  eigenmodes  of  a  waveguide,  a  node-based  hnite- 
element  analysis  with  a  penalty  method  will  still  give  spurious  solutions  at  discrete  frequencies;  (2) 
the  boundary  conditions  in  node-based  FEM  are  very  difficult  to  enforce.  Unless  special  conditions 
are  carefully  specihed  at  every  perfect  electric  and  magnetic  conductor  (PEC  and  PMC),  or  impedance 
boundary,  as  well  as  any  boundary  between  two  dissimilar  media,  it  is  easy  to  incur  errors  in  the  solution. 
In  contrast,  boundary  conditions  are  naturally  imposed  when  using  edge  elements,  and  little  additional 
effort  is  needed  for  their  specihcation. 

Material  Properties  and  Boundary  Conditions.  Material  properties  can  be  viewed  as  scalar  attributes  of 
the  elements.  These  attributes  are  real  numbers  for  time-domain  algorithms,  including  thermomechanical 
codes,  and  complex  numbers  for  frequency- domain  algorithms.  If  the  constitutive  relation  associated 
with  the  physics  of  the  problem  is  anisotropic  and  involves  tensor  coefficients,  the  symmetries  inherent 
in  the  constitutive  relation  will  reduce  the  number  of  independent  coefficients  in  the  constitutive  tensor. 
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and  these  coefHcieiits  can  be  derived  from  a  few  scalar  parameters^ .  Something  similar  is  required  when 
solving  electromagnetics  problems  on  conforming  grids  by  means  of  finite- difference  equations,  such  as  the 
FDTD.  In  the  curvilinear  coordinate  system  that  is  generated  to  conform  to  the  scatterer,  one  computes 
the  contravariant  component  of  the  electric  field  along  an  ‘electric  edge’  of  the  cell,  and  the  contravariant 
component  of  the  magnetic  field  along  a  ‘magnetic  edge’  of  the  dual,  or  overlapping  cell  [9].  One  needs 
the  covariant  components  of  the  electric  field  along  an  electric  edge,  however,  in  order  to  calculate  the 
electric  circulation  that  is  used  in  the  FDTD  algorithm.  The  transformation  from  contravariant  to 
covariant  components  of  vectors  requires  the  use  of  the  metric  tensor,  G,  associated  with  the  curvilinear 
coordinates.  Thus,  this  tensor  wiU  have  to  be  stored  as  input  data  with  the  grid.  Fortunately,  this  tensor 
is  symmetric,  which  means  that  only  six  independent  components  need  be  stored.  Furthermore,  one 
can  incorporate  permittivity  and  permeability  information  into  appropriately  weighted  metric  tensors, 
although  basis  vectors  are  needed  to  reconstruct  fields,  set  up  initial  conditions,  and  so  forth  [10]. 

When  solving  eddy-current  problems,  or  coupled  thermo-electromagnetic  problems,  or  problems  with 
nonlinear  materials,  it  is  necessary  to  include  data  that  define  the  conductivity,  permeability,  and/or 
permittivity  as  a  function  of  field  strength  or  temperature. 

Volume-integral  algorithms  in  CEM  utilize  volumetric  sources  as  dependent  variables  to  be  deter¬ 
mined.  Otherwise,  when  one  uses  FD  or  FE  techniques  in  CEM  the  problems  can  generally  be  consid¬ 
ered  to  be  completely  driven  by  boundary  conditions  (BC);  problems  with  volumetric  sources  are  rarely 
encountered. 

Boundary  conditions  can  be  associated  with  the  nodes  or  with  the  boundary  surfaces  or  edges.  For 
edge  elements,  the  BC  only  applies  to  the  edges.  Vector  quantities  can  be  easily  represented  in  their 
3-component  form:  2  tangential  and  1  normal,  although  only  the  tangential  components  of  the  E  and/or 
H  fields  have  to  be  specified  in  edge  elements. 

Closure.  EM  analyses  cannot  be  terminated  abruptly  at  the  external  boundary  of  the  physical  domain. 
It  is  necessary  to  approximate  the  interaction  of  the  system  with  its  environment.  However,  the  closure 
of  a  problem  cannot  be  easily  treated  as  a  simple  nodal  or  edge-based  attribute;  its  parametrization  is 
highly  mo  del- dependent. 

There  are  three  commonly  used  means  of  closing  the  analysis  domain.  The  first  is  to  use  approximate 
vector  absorbing  boundary  conditions  (ABC)  on  a  sphere  [6].  The  second  approach  uses  a  rigorous 
boundary  integral  operator,  that  is  defined  over  a  surface  of  revolution  that  contains  the  problem  domain 
within  its  interior  [7].  The  third  approach  uses  a  surface,  together  with  its  boundary  integral  operator, 
that  conforms  to  the  surface  of  the  domain  of  interest  [8].  In  using  this  approach,  one  can  take  advantage 
of  recent  developments  in  solid  geometry  that  use  NURBS  (non-unifrom  rational  B- splines)  [5]. 

Because  of  the  lack  of  consensus,  we  are  unable  to  derive  a  uniform  representation  of  the  data  structure 
for  closure.  As  a  result,  its  implementation  in  the  Grid  API  will  be  postponed. 

Solution  Data. 

For  problems  that  are  solved  in  the  frequency-domain,  the  E  (or  H)  field  solutions  will  consist  of 
complex  vectors.  Although  the  Grid  API  can  be  coded  to  process  these  data  as  complex  numbers, 
the  imaginary  part  of  the  data  wiU  be  treated  as  a  second  (real)  vector  solution,  in  order  to  maintain 
uniformity  between  CEM  and  TM  solution  data.  This  allows  a  tradeoff  between  transportability  and 
bookkeeping. 

Problems  that  are  solved  in  the  time-domain,  whether  via  the  finite- element  method  or  the  finite- 
difference  time-domain  (FDTD)  method,  will  have  real  vector  solutions  only,  as  in  TM  problems. 

Summary.  We  can  briefly  summarize  the  requirements  of  CEM  grid  and  solution  data: 

•  Nodal  definition  -  node  number,  (a:,y,z)  values,  attributes 

^An  example  is  that  only  two  Lame  constants  are  required  to  define  an  isotropic  constitutive  tensor  that  has  thirty-six 
coefficients  in  elasticity  theory. 


139 


•  Element  definitions  -  element  number,  nodal  connectivity,  attributes  (e.g.,  material  type) 

•  Edge  definitions  -  edge  number,  nodal  connectivity,  attributes 

•  Boundary  conditions 

1.  node-based  -  scalars,  vectors 

2.  edge-based  -  scalars,  vectors 

The  boundary  conditions  may  be  temperature-  and  time- dependent. 

•  Material  definitions  -  The  material  properties  may  be  temperature-  and  frequency- dependent,  etc. 

•  Solution  data 

1.  node- value  -  scalars,  vector 

2.  edge- value  -  scalars,  vectors  (real  -}-  imaginary  parts) 

3.  element-value  -  scalars,  vectors  (real  -f-  imaginary  parts) 

•  Control  information  -  analysis  options,  solver  type,  output  flag,  material  definitions,  and  all  other 

nongeometric  information. 

Requirements  of  Thermomechanical  Grid  and  Solution  Data.  There  is  only  one  dominant  theme 
in  computational  thermomechanics:  the  finite  element  method.  The  development  of  the  Grid  API  for 
TM  codes  is  further  simplified  by  the  fact  that  all  finite  elements  are  node-based.  Furthermore,  because 
one  rarely  uses  frequency-domain  methods  in  thermomechanical  models,  the  solution  data  will  consist  of 
real  numbers,  only. 

In  most  TM  analysis  codes,  three  types  of  elements  are  supported:  beam  (or  line),  shell  (or  surface) 
and  solid  elements.  The  beam  and  shell  elements,  also  known  as  structural  elements,  are  used  in  analyses 
where  the  aspect  ratios  between  the  3  dimensions  are  highly  skewed.  Their  use  is  neccesitated  by  the 
fact  that,  if  treated  as  regular  solid  elements,  these  elements  will  cause  the  system  matrix  to  be  very 
ill-conditioned,  and  the  computational  time  is  greatly  increased.  As  a  result  of  the  very  small  extent  in 
one  (for  shells)  or  two  (for  beams)  of  the  dimensions,  the  structural  elements  possess  additional  degrees 
of  freedom  -  twisting  and  bending  -  which  are  not  present  in  solid  elements. 

The  following  is  a  brief  summary  of  the  requirements  of  the  TM  grid  and  solution  data: 

•  Nodal  definitions  -  node  number,  {x,y,z)  values,  attributes  (e.g.,  nodal  constraints,  nodal  forces, 

etc), 

•  Element  definitions  -  element  number,  nodal  connectivity,  attributes 

•  Material  definitions  -  The  material  properties  may  be  temperature-dependent  and  anisotropic. 

•  Boundary  conditions 

1.  node-based  -  scalars,  vectors 

2.  surface-based  -  scalars,  vectors,  flux 

The  boundary  conditions  may  be  temperature-  and  time-dependent. 

•  Solution  data 

1.  nodes  -  scalars,  vector 

2.  beam  elements  -  scalars,  vectors,  tensors 

3.  surface  elements  -  scalars,  vectors,  tensors 

4.  solid  elements  -  scalars,  vectors,  tensors 

5.  Computed  degrees  of  freedom  associated  with  the  entire  model. 
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•  Control  information  -  analysis  options,  solver  type,  output  flag,  material  definitions,  and  all  other 
nongeometric  information. 


Functionality  of  the  Grid  API.  The  goals  of  the  Grid  API  are:  (1)  to  allow  software  developers 
to  develop  new,  or  wrap  existing,  software  to  be  integrated  into  the  REF;  and,  equally  important,  (2) 
to  facilitate  the  use  of  the  same  grid  data  in  multidisciplinary  analyses.  If  the  requirements  of  the 
CEM  and  TM  grid  and  solution  data  are  compared  strictly  in  terms  of  data  structure,  it  becomes  clear 
that  there  is  major  overlap  between  these  requirements,  in  spite  of  the  different  physics  involved.  The 
fundamental  information  to  be  processed  are  nodal  coordinates,  element  connectivities,  and  nodal  and 
elemental  attributes  which  may  be  scalars,  vectors,  or  tensors.  It  is,  therefore,  possible  to  design  the 
Grid  API  so  that,  with  a  few  exceptions,  most  TM  and  CEM  requirements  can  be  met  with  the  same 
functions. 

The  data  structure  implemented  in  the  geometry  library,  DTJ^URBS[1],  will  be  used  as  the  foun¬ 
dation  for  the  storing  and  retrieving  of  data  performed  by  the  Grid  API.  Its  well- designed,  dynamically 
managed,  memory  and  data  structure  will  accommodate  most,  if  not  all,  types  of  data  encountered  in 
computational  analyses.  This  includes  the  storage  of  grid  and  solution  data  as  “Grid,  Geometry  and 
Analysis”  (GGA)  entities,  and  the  simple  storage  of  arrays  of  numbers  and  characters.  The  use  of 
DT_NURBS  greatly  eases  the  burden  of  finding  a  common  interface  between  different  data  requirements 
in  multidisciplinary  analyses.  With  the  use  of  rational  and  non- rational  B- spline  functions,  multivariate 
functions,  which  can  be  geometric,  solution  and  field  data,  can  be  accurately  and  efficiently  interpolated. 
In  addition,  the  library  provides  many  sophisticated  mathematical  tools  to  manipulate  the  data.  For 
examples,  there  are  functions  to  calculate  the  intersection  of  surfaces  in  multi-dimensional  space,  and 
functions  to  extract  a  constant  parameter  curve  on  a  surface. 

The  following  is  a  summary  of  the  basic  functionalities  provided  by  the  Grid  API: 

1.  The  storage  and  retrieval  of  Cartesian  and  curvilinear,  structured  and  unstructured  grids,  and 
element  connectivity  -  The  coordinate  system  used  in  the  nodal  definition  can  be  either  Cartesian, 
cylindrical  or  spherical.  Hihger  order  elements,  such  as  those  with  mid-side  nodes,  can  be  processed 
too. 

2.  The  storage  and  retrieval  of  material  properties  used  in  the  analysis  -  The  material  properties  can 
have  any  number  of  functional  dependencies. 

3.  The  storage  and  retrieval  of  the  boundary  conditions  specified  in  the  analysis  -  This  includes 
pointers  to  other  large  array  data  used  as  boundary  conditions.  The  large  array  data  may  be  the 
temperature  distribution  calculated  by  a  thermal  code  which  is  then  used  to  drive  a  mechanical 
analysis  in  which  the  stresses  due  to  thermal  expansion  are  computed,  or  field  data. 

4.  Simple  data  analysis  capabilities  -  This  includes  the  computation  of  ceU  volumes,  the  normal  vector 
to  a  surface,  the  distance  of  a  point  from  a  surface,  etc. 

5.  Simple  browsing  capabilities  through  the  entire  grid  data  set. 

With  these  high-level,  machine-independent  functions,  grid  data  can  be  easily  managed  in  a  stan¬ 
dardized  and  highly  portable  manner  by  software  developers,  with  the  underlying  data  pipelines  and  any 
machine- dependency  of  the  data  being  the  responsibility  of  the  Research  and  Engineering  Framework. 
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ABSTRACT 


The  bcLsic  concepts  of  scientific  visualization  are  described:  visualization  itself,  associated  graph¬ 
ics  data,  and  the  application  programmer  interface  (API).  The  potential  role  of  standards  and 
the  extent  to  which  actual  standards  realize  this  potential  is  explored.  Although  existing  stan¬ 
dards  adequately  cover  low-level  graphics  data,  there  is  a  need  for  standardization  of  higher-level 
semantically  rich  data.  Standardization  of  APIs,  both  officially  and  de  facto,  has  been  an  ac¬ 
tive  field,  but  no  clear  leader  has  yet  emerged.  Visualization  tools  are  explained  within  this 
framework. 

KEYWORDS:  application  programmer  interface;  graphics  data;  graphics  standards;  visualiza¬ 
tion. 


1  Introduction 


The  Microwave/Millimeter  Wave  Advanced  Computational  Environment  (MM ACE)  Project 
must  support  powerful  and  intuitive  visualization  features  in  order  to  be  successful.  Provid¬ 
ing  these  facilities  is  a  real  technical  challenge  because  of  the  wide  range  of  disciplines  involved  in 
the  design  of  microwave  tubes:  manufacturing,  thermal  analysis,  and  simulation  of  electromag¬ 
netic  phenomena,  to  cite  just  a  few.  Clearly  MM  ACE  cannot  create  a  visualization  infrastructure 
from  scratch,  but  must  adopt  and  customize  existing  visualization  standards  and  conventions. 
The  exercise  of  applying  these  conventions  to  the  project  also  serves  to  highlight  gaps  in  the 
visualization  standards  currently  available. 


2  Visualization  Concepts 


Visualization  is  the  generation  of  a  visual  representation  of  some  sort  of  data.  Interactive  visual¬ 
ization  includes  the  ability  of  the  human  viewer,  in  real  time,  to  manipulate  the  representation 
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in  some  way  that  corresponds  to  the  properties  of  the  data  being  displayed.  The  data  to  be 
depicted  can  be  generated  in  many  ways,  e.g.  as  the  result  of: 

•  an  experimental  measurement, 

•  a  simulation  (or  modeling)  of  some  physical  phenomenon, 

•  a  mathematical  operation, 

•  the  design  of  an  artifact. 


Most  visualization  applications  fall  into  one  of  two  broad  categories:  realistic  or  conceptual.  In 
the  former,  the  intent  is  to  represent  some  physical  object  as  it  would  actually  appear  to  the 
human  eye.  Lighting  effects,  shading,  reflection,  shadows,  and  refraction  are  all  characteristic  of 
realistic  rendering.  Typical  applications  are  those  where  the  final  appearance  of  the  product  is 
of  interest,  such  as  automobile  body  design  and  architecture. 

The  purpose  of  conceptual  visualization,  on  the  other  hand,  is  to  convey  a  more  abstract  under¬ 
standing.  Typical  applications  might  be  a  blueprint  for  a  mechanical  part,  or  coloring  to  indicate 
the  value  of  some  variable,  such  as  temperature  in  a  weather  map.  The  objects  being  repre¬ 
sented  need  not  be  actually  visible,  nor  indeed  real  physical  objects  at  all,  as  would  be  the  case 
for  realistic  visualization.  Rather,  inherently  invisible  entities  such  as  mathematical  functions, 
isosurfaces,  and  wind-chill  factors  can  all  be  mapped  into  some  set  of  visual  shapes  and  features. 

Although  the  main  focus  of  this  paper  is  on  conceptual  visualization,  the  application  programmer 
interface  (API)  standards  discussed  herein,  such  as  PHIGS,  PEX  and  OpenGL,  also  have  many 
features  supporting  realistic  visualization. 

Visualization  functionality,  then,  consists  of  transforming  some  internal  representation  of  the 
objects  to  be  visualized  into  a  form  directly  processible  by  an  external  device  such  as  a  CRT 
or  plotter.  For  interactive  visualization,  external  user  events,  such  as  mouse  clicks,  must  be 
interpreted  as  commands  to  modify  the  way  in  which  the  object  is  depicted.  This  central  concept 
of  the  “graphics  pipeline”  is  useful  to  keep  in  mind  in  order  to  understand  how  the  various 
standards  work  together  to  support  visualization. 

The  role  of  standards  is  twofold:  1)  the  definition  of  intermediate  forms  of  representation  — 
visualization  data  —  within  the  transformation  from  internal  to  visual,  and  2)  the  definition  of 
APIs  to  allow  direct  control  of  the  visualization  process  by  platform-independent  software. 


3  Visualization  data 


There  are  a  great  many  graphics  data  formats,  far  more  than  can  be  discussed  here  in  any  detail. 
For  information  on  formats  mentioned  below,  see  recent  surveys  [KAY92],  [MURR94].  Also,  see 
the  world-wide  web  (WWW)  reference  list,  section  7. 
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Graphics  formats  can  be  categorized  according  to  two  criteria:  representation  level  and  geometric 
level,  as  explained  below. 

Visualization  data  with  a  high  representation  level  directly  describes  the  phenomenon  of  interest 
in  units  appropriate  to  the  problem  domain;  data  at  a  lower  representation  level  corresponds 
more  directly  to  the  appearance  of  the  computer  display.  The  three  representation  levels  we 
consider  are  modelling,  graphics,  and  device. 

The  modelling/graphics  distinction  indicates  whether  data  is  meant  to  describe  real-world  entities 
(properties  such  as  mass,  physical  size,  charge,  etc.)  or  to  describe  the  appearance  of  graphical 
entities  (dotted  line,  fonts,  etc.)  as  they  are  to  be  displayed.  Note  that  some  properties,  such  as 
geometry  and  color,  are  meaningful  in  both  realms  —  but  in  context  it  should  be  clear  how  they 
are  to  be  interpreted.  Of  course  modelling  data  is  not  inherently  graphical;  our  interest  here  is  in 
data  that  is  susceptible  to  visualization.  Modelling  data  as  such  does  not  describe  how  it  is  to  be 
displayed  e.g.  which  (graphical)  colors  to  associate  with  which  (real-world)  temperatures.  Finally, 
the  device  level  is  data  which  corresponds  closely  to  the  primitives  of  the  graphics  device;  this 
could  be  a  low-level  protocol  or  even  a  description  of  the  frame  buffer  contents,  which  determines 
how  screen  pixels  are  colored. 

Note  that  visualization  almost  always  involves  the  implicit  or  explicit  conversion  of  modelling 
data  to  device  data,  perhaps  via  the  intermediate  step  of  graphics  data.  Furthermore,  after  this 
conversion,  the  original  meaning  of  the  modelling  data  is  usually  not  preserved  in  the  lower  level 
format.  E.g.  a  vector  field  may  become  a  set  of  yellow  arrows,  which  in  turn  become  lighted 
pixels. 

The  geometric  level  of  data  simply  indicates  whether  the  basic  entities  represented  have  non¬ 
trivial  spatial  extension,  such  as  lines,  non-uniform  rational  B-spline  (NURB)  surfaces,  conic 
surfaces,  etc.  (often  called  vector  data)  or  are  merely  values  at  2D  or  3D  point  locations. 

To  sum  up,  with  examples: 


vector  data 

point  data 

modelling 

STEP 

HDF 

graphics 

CGM 

GIF 

device 

PEX 

bitblt  (bit  block  transfer) 

Figure  1:  Examples  of  Visualization  Data  Formats 

Among  the  criteria  to  keep  in  mind  when  evaluating  which  format  (s)  to  use  for  a  given  application 
are: 

•  functionality  (e.g.  support  for  3D,  splines) 

•  strength  of  standardization  (well-defined,  interpretation  available,  not  dependent  on  single 
vendor,  conformance  tests  available) 
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•  widespread  use  and  marketplace  support 

•  supporting  software  tools:  cost  and  power 

•  storage  efficiency 

•  performance  (speed)  potential 

3.1  Modelling  data 

Modelling  data  standards  specify  the  internal  representation  of  a  given  object  or  feature  in 
the  domain  of  interest,  for  instance,  how  to  represent  a  vector  field  internally  so  as  to  allow 
calculations  on  it.  There  seem  to  be  two  broad  areas  of  standardization  relevant  to  MM  ACE: 
one  for  manufacturing  and  design,  exemplified  by  IGES  and  STEP,  the  other  for  scientific  data, 
exemplified  by  HDF,  CDF,  and  NetCDF. 

In  the  MMACE  program,  standardization  of  modelling  data  is  addressed  by  the  common  data 
dictionary,  so  that  all  MMACE  applications  will  use  the  same  format  to  describe  objects  of 
interest.  How  this  common  data  (geometric  or  otherwise)  interfaces  with  the  available  standards 
is  a  significant  visualization  design  issue.  For  instance,  if  the  data  dictionary  specifies  the  use  of 
HDF,  MMACE  would  then  “inherit”  HDF-supporting  software,  but  also  be  limited  to  whatever 
HDF  can  express.  Of  course,  the  choice  of  modelling  data  format  is  a  very  basic  design  decision, 
with  implications  far  beyond  visualization  support.  For  instance,  the  availability  of  database 
support  for  a  given  format  would  be  a  major  criterion. 


3.2  Graphics  data 

Vector  data  (that  describing  extended  spatial  objects)  provides  a  somewhat  abstract  description 
of  the  scene  to  be  displayed.  It  often  includes  information  such  as  3D  shape  and  position  of 
points,  lines,  surfaces,  and  volumes,  intrinsic  color  of  objects,  width  and  style  of  lines,  size  and 
style  of  markers,  position  and  orientation  of  virtual  camera,  lighting  effects,  etc.  Among  the 
formats  in  common  use  are  CGM  and  DXF. 

Pixel-oriented  graphics  data,  also  called  bitmap  or  raster  data,  often  requires  a  lot  of  storage; 
associated  compression  techniques  are  therefore  important.  Generally  one  cannot  easily  recover 
higher- level  components  from  the  pixel  data  alone.  Most  of  these  formats  are  2D  only.  Among 
the  more  popular  are  GIF,  JPEG,  and  TIFF. 


3.3  Device  data 

Visualization  data  at  this  level  is  very  tightly  defined,  down  to  the  bit  level.  It  is  intended 
to  be  interpreted  either  directly  by  a  graphics  device  or  by  a  low-level  device  driver  or  server. 
Standardization  is  especially  important  here  because  it  allows  one  vendor’s  system  to  generate 
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graphic  data  that  may  be  displayed  on  that  of  another  over  a  network.  The  X  protocol  has  been 
widely  adopted  by  the  UNIX  workstation  world.  There  are  two  3D  extensions,  PEX  and  GLX, 
oriented  towards  the  PRIGS  and  OpenGL  APIs,  respectively. 


4  Application  Programmer  Interfaces 


Visualization  API  standards  specify  the  interface  between  application  programs  and  a  supporting 
graphics  library.  Typically,  they  have  features  to  support  automatic  conversion  of  data  from 
modelling  to  screen  coordinates,  perspective  views,  transformations,  lighting  effects,  and  data 
visualization.  Also,  the  program  should  be  able  to  control  and  interpret  input  from  the  user  in 
terms  of  the  model. 

Thus,  a  program  written  using  an  API  effectively  performs  the  conversion  of  modelling  data  to 
device  data,  obviating  the  need  to  capture  the  lower  level  data  explicitly.  Furthermore,  since  the 
program  is  in  control  of  the  whole  graphics  pipeline  at  once,  it  can  support  richer  interactive 
visualization,  since  the  program  “understands”  the  meaning  of  the  object  being  displayed,  and 
thus  can  interpret  user  input  in  terms  of  that  object.  By  contrast,  a  generic  visualizer  which 
accepts  graphics  data  typically  cannot  do  more  than  perform  generic  geometric  (e.g.  rotation) 
and  graphical  (e.g.  modify  color)  operations  on  the  displayed  object. 

We  will  consider  here  only  3D  graphic  APIs  which  have  multi-vendor  support.  The  three  major 
interfaces  meeting  these  criteria  are  OpenGL  [OGL93],  PEXlib  [PEX92],  and  PRIGS  [PHIGS89]. 
Again,  a  detailed  comparison  is  beyond  the  scope  of  this  paper,  but  a  brief  survey  of  features  is 
presented  in  the  table  below.  All  these  APIs  are  generic  graphic  libraries;  they  are  not  oriented 
towards  any  particular  domain.  If  one  needed  to  support  a  large  application,  it  might  well  be 
advisable  to  build  a  higher-level  utility  API  on  top  of  one  or  more  of  these,  first  to  shield  the 
application  programmer  from  some  of  the  lower-level  details  of  graphics  rendering,  and  second 
to  build  in  support  for  the  types  of  objects  to  be  manipulated  in  the  domain  of  interest. 


5  Visualization  Tools 


Visualization  standards  support  not  only  the  programming  of  specific  applications  by  the  eventual 
end-user,  but  also  the  development  of  portable  general-purpose  tools,  some  of  which  may  be  made 
freely  available.  Most  such  tools  accept  data  at  the  graphics  or  device  level  and  generate  a  display 
as  a  result.  The  higher  level  the  data,  the  higher  level  operations  that  may  be  available.  For 
instance,  if  the  tool  starts  with  3D  graphics  data  it  may  well  allow  the  user  to  rotate  selected 
objects  around  a  specified  axis,  move  them  relative  to  each  other,  or  view  them  from  various 
perspectives.  If  the  tool  accepts  only  2D  pixel  data,  such  functionality  is  impractical;  the  tool 
is  likely  to  allow  only  simple  operations  such  as  cropping  and  scaling.  At  the  other  extreme,  a 
“domain-savvy”  tool  would  be  able  to  perform  operations  that  were  meaningful  in  the  semantic 
domain  of  the  application.  For  instance,  a  tool  that  visualizes  an  EM  vector  field  might  allow 
the  adjustment  of  the  position  and  strength  of  field  sources. 
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Figure  2:  Visualization  API  criteria 

The  functionality  of  a  tool  therefore  can  be  pretty  well  summarized  by  asking: 

•  What  level  of  visualization  data  does  it  accept,  and  which  formats  in  particular? 

•  What  operations  does  it  allow? 

•  What  level  of  visualization  data  does  it  generate,  and  which  formats  in  particular? 

•  What  environment  does  it  run  under? 


For  instance,  xv  may  be  characterized  as  a  tool  that  accepts  graphics-level  pixel  data  (GIF, 
TIFF,  JPEG,  etc.),  supports  operations  such  as  sizing,  cropping,  and  color  editing,  generates 
graphics-level  pixel  data  and  device  level  data  (i.e.  draws  picture),  and  works  in  the  UNIX/X 
environment. 


6  Conclusion 


Strong  visualization  standards  can  promote  the  development  of  portable  graphics  software  tools, 
and  facilitate  the  exchange  of  graphics  data  across  different  vendors’  platforms,  and  among  various 
applications.  In  the  past  standards  usually  originated  either  in  “official”  standards  committees, 
such  as  those  operating  under  the  American  National  Standards  Institute  (ANSI)  or  International 
Standards  Organization  (ISO)  umbrella,  or  from  a  single  dominant  vendor.  It  is  becoming  more 
common,  however,  for  standards  to  be  promulgated  by  consortia  of  vendors  and  other  interested 
parties,  such  as  the  X  Consortium  and  OpenGL  Architecture  Review  Board.  The  consortium 
method  often  allows  greater  speed  and  flexibility  in  the  formulation  of  standards,  but  it  becomes 
more  difficult  for  the  user  to  decide  among  competing  standards  in  the  same  application  area. 
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7  World-wide  web  (WWW)  Reference  List 


The  following  Universal  Resource  Locators  (URLs)  contain  information  pertinent  to  this  article. 

MMACE  project: 

http://mmace.nrl.navy.mil:8080/ 

Computer  Graphics  and  Visualization: 

http:/ /www.dat  aspace.com/WWW/vlib/comp-gr  aphics.html 
http:  /  /  www.khoros.unm.edu/pointers  /  visualization.html 

Scientific  data  formats: 

ftp://rtfm.mit.edu/pub/usenet/news.answers/sci-data-formats  or 
http://fits.cv.nrao.edu/trafRc/scidataformats/faq.html 

Graphics  data  formats: 

ftp:/ /ftp.  ncsa.uiuc.edu/misc/file.format  s/graphics,  formats 
ftp://zamenhof.cs.rice.edu/pub/graphics.formats 
http:// gopher.ora.com:  70/ 1  /  descriptions/soft /gff 

X  consortium  specifications,  such  as  X,  PEX,  and  PEXlib: 
http://ftp.x.org 

Hierarchical  Data  Format  (HDF): 

http://www.ncsa.uiuc.edu/SDG/Software/HDF/HDFIntro.html 

NetCDF: 

http://www.unidata.ucar.edu/packages/netcdf 
PHIGS  Validation  Tests: 

http://speckle.ncsl.nist.gov/~cugini/pvt/hy-top.html 

Information  on  OpenGL: 

http://www.sgi.com/tech/openGL/opengl.html 
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Visualization  Toolkit  for  Computational  Electromagnetics 


R.  A.  Joseph 

ARCON  Corporation 
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Waltham,  MA  02154 

1.  Abstract. 

This  paper  is  a  brief  description  of  a  visualization  system  under  development  for  use  with 
electromagnetic  simulation  computer  programs [1]. 

The  visualization  software  is  designed  to  be  a  generic  package  capable  of  operating  with  sev¬ 
eral  Electromagnetic  (EM)  codes  and  various  hardware  platforms.  Geometric  models  of  irradiated 
and/or  radiating  objects  are  obtained  using  CAD  software.  These  models  are  translated  into  data 
files  appropriately  formatted  as  input  to  the  EM  code.  The  output  of  the  EM  computation  is  then 
processed  by  the  visualization  system  to  display  such  quantities  as  surface  currents,  antenna  radi¬ 
ation  patterns  and  EM  fields.  A  platform  independent  language,  OpenGL,  is  used  to  visualize  inter¬ 
mediate  and  final  results.  Execution  of  the  codes  is  under  the  control  of  a  graphical  user  interface. 

2.  Introduction. 

Computer  programs  for  electromagnetic  analysis  belong  to  a  large  class  of  simulation  pro¬ 
grams  that  were  developed  to  model  physical  systems.  These  simulation  programs  provide  valuable, 
often  indispensable,  assistance  to  engineers  and  scientists  and  are  widely  accepted  and  applied  in  re¬ 
search,  development  and  product  design.  EM  analysis  and  other  physical  simulation  programs  pro¬ 
vide  an  ideal  client  base  for  the  application  of  modem  graphics  and  visualization  techniques. 
Because  many  of  these  simulation  programs  were  written  before  graphics  workstations  and  high¬ 
speed  microprocessors  became  available,  there  are  many  instances  in  which  the  formats  of  the  input 
model  data  and  output  information  can  be  substantially  modified  and  improved  with  visualization 
software. 

GEMACS  (General  Electromagnetic  Model  for  the  Analysis  of  Complex  Systems)[2]  is  a 
widely  used  (distributed  to  over  50  Government  and  commercial  organizations)  modeling  code  used 
to  analyze  the  behavior  and  performance  of  electromagnetic  systems.  GEMACS  is  written  in 
FORTRAN-77  and  has  the  advantage  of  running  on  several  different  platforms,  including  main¬ 
frames,  workstations  and  IBM-PC  486  compatible  computers.  Although  GEMACS  modeling  capa¬ 
bility  and  the  depth  and  variety  of  information  provided  by  the  electromagnetic  analysis  are 
considerable,  running  the  code  requires  a  great  deal  of  work  creating  a  representation  of  the  input 
geometry  in  terms  of  GEMACS  primitives.  In  addition,  the  run  results  are  usually  expressed  as  ta¬ 
bles  of  numbers  which  convey  far  less  than  a  graphical  presentation. 

We  have  begun  a  project  to  enhance  the  utility  of  GEMACS  by  building  on  to  it  an  interface 
which  will  translate  output  from  any  standard  CAD  program  into  a  suitable  format.  This  interface 
will  also  allow  the  convenient  visualization  of  output  from  a  run  of  the  GEMACS  code  and  will 
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include  a  Graphical  User  Interface  allowing  the  user  to  issue  instructions  to  the  program  via  a  point¬ 
ing  device. 

3.  Development. 

Our  development  work  has  been  done  primarily  on  a  Silicon  Graphics  Indigo^  Workstation 
featuring  a  MIPS  R4400  150MHz  processor,  64  MB  memory  and  IRIX  Version  5.2. 

All  of  our  code  is  written  in  ANSI  C,  with  function  calls  to  the  X  Window  system  through 
OSF/Motif.  Our  graphics  programming  was  done  with  function  calls  to  OpenGL[3].  OpenGL  is  a 
hardware-independent  graphics  language  originally  available  only  on  Silicon  Graphics  Workstations, 
but  now  implemented  on  nearly  ail  workstations,  including  DEC,  HP  and  Sun.  It  is  supplied  with 
the  Silicon  Graphics  operating  system,  IRIX  version  5.2,  or  later.  It  has  also  been  adapted  for  use  in 
Windows  NT  for  IBM-PC  compatibles  and  will  be  used  in  Windows  95  when  it  becomes  available. 
Programming  with  OpenGL  thus  allows  us  the  ability  to  quickly  adapt  our  program  to  multiple 
platforms. 

The  CAD  package  used  for  this  project  was  ACAD75J[4],  a  CAD  package  available  to 
members  of  the  Electromagnetic  Code  Consortium.  All  that  was  required  of  the  CAD  package  was 
the  ability  to  output  geometry  information  in  terms  of  IGES[5]  (Initial  Graphics  Exchange  Specifica¬ 
tion).  Nearly  all  modern  CAD  packages  are  able  to  do  this.  Thus,  the  requirement  for  the  interface 
is  reduced  to  the  ability  to  translate  from  IGES  graphical  entities  to  GEM  ACS  primitives. 

Some  of  the  most  important  IGES  entities  used  are  Non-Uniform  Rational  B-Spline 
(NURBS) [6]  curves  and  surfaces.  These  occur  naturally  in  CAD  applications  since  they  allow  for 
the  compact  encoding  of  complicated  curves  and  surfaces.  Surfaces  in  GEMACS  are  not  represent¬ 
able  by  NURBS  entities,  but  rather  as  combinations  of  facets. 

When  we  first  embarked  on  this  project,  we  had  planned  to  integrate  DT-NURBS[7]  into  our 
program  to  handle  NURBS  entities.  We  found,  however,  that  one  of  the  advantages  of  OpenGL  is 
that  it  allows  the  direct  interpretation  of  NURBS  curves  and  surfaces. 

In  addition,  OpenGL  can  also  be  used,  when  operating  in  feedback  mode,  to  translate 
NURBS  entities  into  a  series  of  vertices  which  can  then  be  used  to  generate  the  geometry  section  of 
a  GEMACS  command  file  consisting  of  plate  surface  descriptions.  Although  it  is  totally  transparent 
to  the  user,  the  model  image  is  actually  drawn  twice.  The  first  time,  it  appears  on  the  screen.  The 
next  time  it  is  drawn,  the  information  that  would  normally  go  to  the  screen  is  instead  sent  to  an 
array,  which  is  then  written  to  a  temporary  file.  It  is  the  information  in  this  file  that  is  used  to  gen¬ 
erate  the  command  file  for  the  EM  simulation  program. 

4.  Program  Function. 

Figure  la  shows  the  program  screen  after  the  import  of  a  geometric  figure  which  was 
created  by  ACAD75J  and  stored  in  the  IGES  format.  The  menu  bar  at  the  top  offers  four  selections: 
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1 .  File,  offers  two  options: 

a)  Open:  allows  the  user  the  option  to  import  a  previously  created  IGES  file. 

b)  Quit:  to  end  the  program. 

2.  Launch,  offers  four  options: 

a)  ACAD:  shells  to  the  CAD  program. 

b)  Cull:  a  program  which  will  reduce  the  number  of  facets  through  combination  of 
neighboring  facets. 

c)  GEMACS:  creates  a  GEMACS  command  file,  and  then  shells  to  GEMACS. 

d)  MATLAB:  strips  and  reformats  the  GEMACS  output  files,  then  calls  Matlab  to  visualize 
the  results. 

3.  View,  offers  two  options: 

a)  A  toggle  which  will  facetize  the  solid  surface,  as  shown  in  Figure  lb. 

b)  View:  offers  the  user  the  ability  to  change  viewing  angles. 

4.  Help. 

The  model  shown  in  Figure  1  was  drawn  using  ACAD  specifically  for  use  during  the  devel¬ 
opment  phases  of  our  project.  The  figure  is  a  simple  two-dimensional  surface,  but  the  presence  of 


(a)  (b) 


Figure  1.  EMViz  home  window. 

the  two  curved  vertical  edges  causes  the  CAD  program  to  code  the  surface  as  a  NURBS  surface 
within  the  IGES  file.  One  of  our  primary  objectives  was  to  provide  a  method  for  the  translation  of 
NURBS  entities  into  a  format  compatible  with  GEMACS  command  files. 

5.  Future  Plans. 

The  program  screen  shown  in  Figure  1,  EMViz,  is  a  version  developed  specifically  to  work 
with  GEMACS.  At  this  stage  it  is  able  to  translate  only  a  limited  number  of  the  100  different  types 
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of  IGES  entity  types.  In  addition,  the  code  does  not  take  advantage  of  all  the  GEMACS  geometry 
primitives  available.  In  addition  to  filling  in  these  gaps,  we  will  be  extending  the  User  Interface  to 
allow  the  user  to  have  more  control  over  GEMACS  execution. 

We  also  plan  to  continue  the  development  of  our  program  with  the  inclusion  of  other  electro¬ 
magnetic  modeling  programs  such  as  GRECO[8],  and  NEC [9],  and  also  other  physical  modeling  sit¬ 
uations  such  as  are  encountered  in  mechanical  engineering  (heat  transfer,  statics),  environmental 
engineering  (fluid  flow,  particle  diffusion),  and  electronic  engineering  (charged  particle  transport) 
applications,  to  name  a  few.  We  also  plan  to  port  our  code  to  all  types  of  workstations,  and  also  to 
the  PC  running  WindowsNT  or  Windows  95. 

The  result  of  this  project  will  be  a  product  called  ArconViz. 
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1.0  Introduction 

This  paper  reviews  the  development  of  the  MMACE  (Microwave  and  Millimeter-wave  Advanced 
Computational  Environment)  using  the  REF  (Research  Engineering  Framework).  MMACE  is  a 
specific  usage  of  the  generic  REF  framework  for  the  microwave  tube  industry  in  which  the  design 
environment  is  being  specifically  developed  in  support  of  power  tube  design.  This  paper  therefore 
draws  upon  experiences  gained  during  the  interface  of  microwave  tube  design  codes  with  the  REF. 

Although  the  examples  used  in  this  paper  refer  to  the  microwave  tube  code  experience,  the  discussion 
will  generalize  the  procedures  followed  and  the  lessons  learned.  It  is  the  intent  of  this  paper  to  address 
the  scientific  code  interface  process  from  the  point  of  view  of  a  neophyte  who  wishes  to  insert  an 
analysis  code  into  the  REF. 

Specifically,  the  following  issues  will  be  addressed:  (1)  the  characteristics  of  codes  which  are  readily 
inserted  into  REF,  (2)  the  control  panel  insertion  procedure,  (3)  the  user-interface  wrappering 
procedure  for  codes,  (4)  the  data  connectivity  procedure  for  codes,  and  (5)  the  data  modeling  and  data 
dictionary  tasks  which  ensure  re-usability  of  data  within  REF.  Future  directions  of  REF  plug-and-play 
activities  will  be  discussed  and  the  advantages  and  disadvantages  of  these  directions  will  be 
highlighted. 


2.0  MMACE  Phase  1  Prototype 

During  Phase  1  of  the  MMACE  Program,  a  prototype  software  environment  was  produced.  During  the 
period  of  the  development  of  this  prototype,  the  MMACE  program  concentrated  on  developing  a 
detailed  understanding  of  the  computational  requirements  for  advanced  microwave  power  tube  design. 
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Several  of  the  key  concepts  which  were  identified  during  this  18-month  period  have  been  implemented 
in  this  prototype  design  system  which  includes  a  subset  of  existing  tube  industry  design  codes.  It  was  a 
key  objective  of  Phase  1  of  the  MMACE  program  that  this  prototype  provide  a  useful  design  capability 
to  the  tube  industry  and  it  was  on  that  basis  that  the  subset  of  codes  was  selected.  Specifically,  the 
selected  subset  of  codes  provides  a  design  capability  through  to  the  PIC  (particle-in-cell)  codes  and 
includes  a  thennomechanical  analysis  capability. 

The  MMACE  prototype  also  focused  on  providing  a  first  cut  at  design  data  management  and 
integration  whilst  providing  a  consistent  and  intuitive  interface  to  the  user.  A  master  geometry  data 
model,  a  first  generation  of  wrapper  technology  and  a  control  panel  (using  a  design  flow  paradigm) 
were  also  provided.  In  addition,  the  prototype  was  an  ideal  vehicle  to  provide  the  integrators  with  an 
opportunity  to  gain  experience  with  the  tube  codes,  the  tube  industry  itself  and  the  issues  particular  to 
the  high  frequency  electromagnetics  arena.  Further,  during  the  development  of  the  prototype,  the 
difficulty  of  a  variety  of  implementation  issues  were  evaluated,  tested  and  invaluable  lessons  learnt  for 
future  development.  The  future  development  of  REF  will  benefit  significantly  from  the  leveraged  use 
of  this  prototype  experience. 

3,0  Future  REF  Development 


The  REF  of  the  future  will  be  a  complete  graphically  oriented  computational  environment.  It  will  be  a 
geometry-based,  user-oriented,  data-driven  and  team  engineering  managed  environment  for  high 
frequency  device  design.  This  environment  will  be  an  enabling  technology  for  improved 
manufacturing,  particularly  with  regard  to  the  cost  and  quality  issues  so  essential  to  the  bottom  line  of 
a  manufacturing  organization. 

In  order  to  achieve  the  above  objectives,  the  following  are  goals  of  the  REF  environment  development 
effort  during  the  MMACE  program: 

(a)  the  user  will  be  enabled  so  that  he/she  can  add/enhance/delete  codes  in  the  REF  environment; 

(b)  a  comprehensive  parametric  design  data  model  will  be  developed  complete  with  a  user- 
extensible  data  dictionary. 

(c)  a  comprehensive  geometry  data  model  to  support  the  use  of  2D  and  3D  tools.  The  geometry 
socket  approach  available  in  the  prototype  will  continue  into  the  full  REF  environment  and 
thereby  permit  the  connection  of  a  variety  of  front-end  CAD  tools.  Selection  of  the  appropriate 
CAD  tools  will  provide  connectivity  into  the  CAD/CAM  world; 

(d)  the  complete  REF  environment  will  enable  the  user  to  move  readily  up  and  down  the  data 
abstraction  pyramid,  provide  for  different  views  of  the  same  data  and  provide  a  consistent  set 
of  visualization  tools; 

(e)  a  graphical  template  builder  for  the  wrapper  technology  rather  than  the  text  based  builder 
available  in  the  prototype.  The  graphical  approach  will  reduce  the  development  and 
maintenance  time  for  the  wrapper  technology. 
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(f)  full  design  data  management  in  order  to  provide  the  right  data  to  the  right  person  at  the  right 
time.  The  data  management  approach  will  need  to  capture  all  data  relevant  to  geometry-based 
design  disciplines  including  parametric,  geometry,  array  data,  grids,  test,  specifications  and 
materials  data. 

(g)  a  user-configurable  graphically-driven  control  panel  which  through  its  intuitive  look  and  feel 
will  shorten  the  user  learning  curve  and  reduce  errors. 

(h)  the  REF  environment  will  need  to  provide  various  facilities  including  state-of-the-art 
connectivity  and  openness  as  well  as  data  backup  and  recovery. 

By  providing  such  facilities,  REF  will  provide  a  positive  improvement  to  the  design  and  manufacturing 
process.  The  following  table  captures  some  of  the  comparative  aspects  of  a  pre-REF  and  a  post-REF 
scenario. 


ITEM 

Legacy  design  data 

Disorganized,  n on-electronic 

Organized,  accessible,  electronic 
database 

Parts  re-use 

Not  common 

Enabling,  standard  parts  library 
complete  with  test  data 

Use  of  new  codes 

Not  easily  connectable  to 
existing  data,  difficult  to  validate 

Straightforward  to  connect  to 
REF  compliant  data;  readily 
validate  the  new  code  with 
existing  data 

Alternative  design  scenarios 

Difficult  to  add  alternative 
design  scenarios 

Straightforward  to  add  an 
alternative  design  to  complement 
main-stream  design 

Connection  to  drafting  and 

machine  shop 

Not  electronic,  slow,  possibility 
of  error 

Electronic,  rapid,  minimization 
of  errors 

Manufacturing 

Afterthought  to  design  process 

Design  team 

Single  local  location 

Distributed  computing/ 
distributed  design 

Table  1:  Pre-  and  Post-REF  Comparison  Matrix 


4.0  Lessons  Learned 


4.1  Characteristics  of  Codes  to  Facilitate  Easy  Insertion  into  REF 

It  is  important  to  determine  what  “characteristics”  of  codes  define  the  relative  ease  or  difficulty  of 
insertion  of  such  codes  into  the  REF  framework.  Specifically,  the  term  “characteristics”  of  codes 
means  the  syntax  and  signatures  of  the  data  I/O  stream  and  the  sequence  by  which  this  data  is  input  and 
output  by  the  code.  The  paradigm  for  which  REF  is  geared  towards  are  codes  which  are  batch 
oriented,  i.e.  there  is  very  little  interactive  user  input  once  the  code  has  been  given  its  set  of  data 
inputs.  These  codes  then  run  using  this  data  input  and  produce  the  computed  data  output.  Whether  the 
code  takes  a  short  or  a  long  time  to  complete  its  execution  is  not  relevant;  what  is  relevant  is  that  all 
data  necessary  for  the  code  to  run  through  to  completion  must  be  present  prior  to  starting  the  code. 

In  order  for  relatively  easy  insertion  of  a  code  to  occur,  the  syntax  of  the  file  input  and  output  of  the 
code  must  be  fairly  straightforward.  Straightforward  in  this  context  means  that  the  syntax  should  not 
contain  significant  amounts  of  branching  of  syntax,  branching  in  the  sense  that  different  sequencing  of 
syntax  occurs  depending  on  the  value  of  data  in  prior  lines.  Whilst  complex  branching  can  be  added  to 
REF  as  REF  evolves,  there  becomes  a  limit  in  scope  of  benefits  versus  expended  effort  to  incorporate 
flexibility  for  extraordinarily  complex  data  syntax. 

4.2  REF  Control  Panel 


The  control  panel  is  the  first  and  most  important  interface  which  a  REF  user  will  see  (see  Figure  1). 

Consequently,  it  is  important  that  this  screen  provide  an  extremely  attractive  flexibility.  Specifically, 

the  user  expects  the  following  functionality: 

•  The  ability  to  configure  the  flow  of  design  tools  in  the  sequence  that  is  appropriate  for  the  design 
that  is  being  contemplated; 

•  The  ability  to  add  and  remove  design  tools  as  required; 

•  The  ability  to  invoke  core  system  utilities  such  as  project  version  control,  visualization  tools, 
project  load  and  save  commands,  etc.; 

•  The  ability  to  provide  help.  During  the  prototype  phase  of  MM  ACE,  it  was  established  that  the 
Mosaic  system  is  the  ideal  candidate  for  the  provision  of  help  and  as  such  all  documentation  which 
is  to  be  used  for  REF  should  be  configured  to  run  within  the  MOSAIC  system. 

In  order  to  connect  a  new  code  to  the  REF  control  panel,  it  is  important  that  the  following  information 

be  available: 

•  the  pathname  to  the  executable  of  the  code; 

•  the  name  of  the  icon  which  will  be  used  to  represent  the  code  on  the  control  panel;  and 
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•  the  pathname  to  the  templates  which  are  used  to  wrapper  the  code  to  the  REF. 
REF  will  provide  this  capability  shortly. 
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Figure  1:  REF  Prototype  Control  Panel 
4.3  User  Interface  Wrappering 

The  process  of  user  interface  wrappering  (see  Figure  2)  is  composed  of  a  sequence  of  steps  which 
provide  two  separate  but  related  pieces  of  functionality  when  wrappering  a  native  code.  Firstly,  the  VO 
data  needs  of  the  native  code  need  to  be  connected  to  the  central  REF  database  mechanism.  Secondly, 
the  user  needs  to  be  able  to  display  the  data  being  input  and  output  for  verification  and  to  also  provide 
the  user  with  the  opportunity  to  edit  the  data,  as  required. 

In  order  to  provide  a  user-interface  to  the  VO  data  entities  which  are  required  by  the  code,  the  process 
consists  of  constructing  a  template  technology  in  which  the  user  can  log  an  inventory  of  all  of  the  data 
entities  which  need  to  be  viewed  or  edited.  In  order  to  display  each  of  these  data  entities,  the  user  must 
provide  in  the  relevant  template,  the  location  of  the  entity  in  the  database,  the  prompt  to  be  displayed 
in  conjunction  with  the  value  of  the  data  entity  and  any  other  useful  information  such  as  the  units  in 
which  the  data  entity  will  be  displayed. 
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The  second  piece  of  the  wrapper  functionality  is  to  extract  the  data  from  the  database  and  provide  it  in 
the  sequence  and  syntax  of  the  input  file,  as  required  by  the  native  code  being  wrapped.  A  similar 
capability  is  required  for  the  output  from  the  code.  This  requires  a  second  piece  of  technology  whose 
function  is  to  provide  for  each  data  item,  its  location  in  the  REF  database  and  the  syntax  in  which  each 
of  the  data  items  should  be  presented.  Clearly,  as  REF  evolves,  the  allowable  syntax  will  expand  to 
address  more  types  of  file  syntax.  Currently,  the  following  styles  of  syntax  are  covered:  (1)  order- 
sensitive  files,  in  which  there  is  a  strict  order  in  which  the  data  is  required  in  the  file  and  for  which 
there  are  no  significant  branches  in  the  order  of  data;  (2)  namelist-style  files  for  which  order  is  not 
important  but  each  data  item  must  be  accompanied  by  a  keyword  which  identifies  to  the  native  code 
the  identity  of  the  data  item  to  which  the  value  belongs. 

The  above  discussion  has  focused  on  parametric  data.  REF  must  also  give  consideration  to  geometric 
data  for  which  several  “firmware”  implementations  have  been  generated  for  a  variety  of  tube  codes  in 
the  REF  prototype.  “Firmware”  is  used  in  the  sense  that  the  code  can  be  easily  customized  by  the 
developers  of  REF  but  it  is  much  more  difficult  for  users  to  make  said  alterations.  It  remains  one  of 
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the  challenges  of  REF  to  generate  a  relatively  generic  approach  to  the  geometric  data  connection  of 
native  codes  via  a  wrapper  to  the  database.  It  is  expected  that  no  matter  what  the  solution  is  for  this 
issue  that  there  will  be  a  requirement  that  users  make  some  minor  modifications  to  their  native  codes 
in  the  area  of  geometry  input. 

The  current  MMACE  prototype  has  focused  exclusively  on  2D  geometry.  Clearly,  this  effort  needs  to 
extended  to  provide  a  3D  capability.  In  addition,  the  future  of  geometry  representations  is  moving  to 
NURBS  and  so  the  REF  must  have  in  its  plans  a  means  for  this  transition  to  be  effected.  Specifically, 
during  the  transition  from  geometry  representation  using  lines  and  arcs  to  that  using  NURBS,  there 
will  be  a  valid  mixture  of  codes,  some  of  which  will  require  NURBS  and  some  of  which  will  expect 
lines  and  arcs.  In  order  to  provide  a  useful  framework,  it  is  necessary  for  REF  to  deal  transparently 
with  the  mixture  of  types  of  geometry  data  availability  and  requirements.  This  can  be  most 
conveniently  done  by  using  a  bi-directional  translator  which  will  be  activated  when  a  mismatch 
between  geometry  data  needs  and  geometry  data  availability  is  detected. 

4.4  Data  Modeling  and  Data  Dictionary 

The  data  model  developed  under  the  MMACE  Phase  1  prototype  activities  was  a  limited  data  model 
which  offered  the  following  features.  Interwoven  in  each  discussion  is  the  lessons  learned  and  the 
approach  to  be  pursued  for  the  next  generation  of  REF: 

•  The  ability  to  separate  generic  and  tool-specific  data.  By  tool-specific  data,  it  is  meant  data  which 
is  needed  specifically  for  that  particular  tool’s  usage  and  has  no  significance  to  the  actual  solution 
of  the  generic  simulation  being  performed  by  the  particular  tool.  This  concept  will  be  maintained 
in  future  generations  of  REF. 

•  Objects  of  data  are  recognized  by  type  name.  For  example,  a  pole  piece  in  a  magnetic  analysis 
solution  would  be  defined  in  the  CAD  drawing  as  having  a  type  of  pole„piece  and  so  all  magnetic 
tools  would  need  to  identify  all  such  objects  with  this  type.  Consequently,  in  a  design  where  there 
are  multiple  pole  pieces,  all  of  these  would  need  a  unique  name  but  all  would  be  of  type 
pole_piece.  This  is  clearly  one  of  the  more  inflexible  pieces  of  the  current  prototype  environment. 
In  order  for  tools  to  connect  to  provide  simulations,  it  is  necessary  for  each  of  them  to  have  an 
inventory  of  all  of  the  types  of  objects  which  are  relevant  to  their  simulation  arena.  For  example,  a 
static  magnetic  analysis  tool  will  need  to  know  all  types  of  materials  which  have  a  permeability 
issue.  This  is  clearly  not  in  the  interests  of  flexibility.  What  is  more  appropriate  is  that  a  tiered 
data  model  be  constructed  which  provides  one  view  for  the  user,  namely  types  such  as  pole_pieces, 
permanent_magnets,  etc.  for  which  a  user  can  tag  all  of  the  various  objects  when  drawing  the 
particular  design  problem  to  be  studied.  However,  a  second  and  more  fundamental  view  needs  to 
also  be  generated  which  contains  features  such  as  permeability  and  permittivity.  It  is  this  second 
view  which  contains  the  fundamental  physics  information  that  will  drive  the  information  which  the 
wrapper  for  various  tools  will  search  for.  Therefore,  under  the  revised  and  more  generic  REF  data 
modeling  approach,  tools  which  perform  magnetic  analysis  will  search  for  all  entities  whose 
second  view  contain  magnetic  information. 
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•  The  object  names  are  used  to  associate  geometric  defining  lines  and  arcs  with  the  corresponding 
parametric  information  in  the  database.  It  is  proposed  that  such  a  means  of  connectivity  will 
continue  to  be  used  as  the  REF  data  modeling  continues  to  evolve. 

•  Mass  quantity  information  such  as  the  particle  and  fields  information  in  the  database  is  stored  with 
appropriate  headers  to  define  the  columns  and  rows  of  this  high  volume  information.  In  moving  to 
the  next  generation  of  REF  software,  the  flexibility  of  this  data  modeling  approach  needs  to 
enhanced  so  as  to  permit  other  high  volumes  of  information  (such  as  CEM  data)  and  it  be 
organized  in  such  a  manner  that  a  group  of  visualization  tools  can  be  easily  targeted  to  provide  a 
visualization  capability  for  this  data. 


4»5  Distributed  Computing  and  Distributed  Design 

In  the  long  term,  REF  will  provide  the  opportunity  for  design  teams  to  utilize  codes  in  distant  places  as 
well  as  to  enable  design  teams  to  work  together  effectively  even  when  the  team  members  are  in  widely 
separated  locations.  For  the  distributed  computing  scenario,  the  REF  environment  will  provide  not 
only  the  control  information  to  activate  the  particular  remote  codes  so  desired  but  to  also  manage  the 
data  flows  into  and  out  of  the  remote  codes  over  the  networks.  In  order  to  provide  for  a  distributed 
design  environment,  a  two  layered  approach  will  be  available.  A  designer  may  remotely  logon  to  a 
REF  system  and  use  the  X-windows  protocols  over  a  high  speed  line  to  perform  designs  on  the  host 
REF  system.  Alternatively,  the  designer  may  logon  to  their  own  local  REF  system  and  "check"  data 
out  of  the  main  remote  REF  data  repository,  perform  the  intended  design  work  and  then  check  back  the 
completed  design  results  to  the  main  remote  repository. 

4,6  Advanced  Gridding  Technology 

REF  is  enabling  in  the  sense  that  advanced  grid  technologies  will  be  available  in  the  environment.  An 
API  is  being  developed  such  that  grids  for  electromagnetic  analysis  and  thermomechanical  analysis  can 
be  accomplished.  Plug  and  play  gridding  tools  suitable  for  various  solution  tools  are  also  being 
studied. 

5,0  Conclusions 


The  REF  prototype  is  a  platform  from  which  a  future  advanced  design  environment  can  grow  and 
develop.  Effective  progress  for  REF  in  the  future  requires  a  strong  link  with  interested  parties  in  order 
to  capture  all  relevant  design  issues  and  to  collect  suitable  feedback  so  that  the  REF  development  can 
adjust  during  its  evolutionary  development  period.  A  users'  group  has  been  formed  to  provide  a  forum 
for  the  discussion  and  collection  of  ideas  and  directions  for  future  generations  of  REF. 
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Abstract 

We  present  a  novel  Method  of  Moments  (MOM)  program  for  general  3D  bodies  which 
is  both  simple  and  robust.  The  algorithm  is  based  on  the  Radiation  Condition  Integral 
Equations  (RCIE)  which  are  a  weak  form  of  the  radiation  condition  expressed  in  terms 
of  the  fields  on  the  surface  of  the  scatterer.  A  key  element  in  the  development  of  the 
RCIE  is  the  separation  of  the  time  harmonic  Maxwell’s  equations  into  two  decoupled 
equations  using  chiral  field  combinations.  In  addition  to  simplifying  the  development, 
this  approach  yields  a  particularly  stable  formulation  of  the  scattering  problem.  The 
resulting  integral  equations  are  discretized  using  a  very  simple  pulse  basis  function  which 
greatly  simplifies  problems  encountered  in  meshing  complex  surfaces  with  overlapping 
interpolational  bases. 


1  Introduction 

The  calculation  of  scattering  from  three  dimensional  targets  of  general  shape  is  a  subtle  and 
complex  problem.  While  many  Method  of  Moments  (MOM)  algorithms  have  been  developed 
for  3D  scatterers,  few  are  simple  to  use.  One  of  the  principle  difficulties  faced  by  the  user  of 
a  MOM  program  is  the  sheer  complexity  of  generating  a  viable  surface  mesh.  Typically,  this 
surface  mesh  is  composed  of  surface  elements  (such  as  triangles)  which  are  required  to  share 
common  nodes  on  common  boundaries.  In  addition  one  must  also  provide  a  list  of  element 
interconnections  which  define  the  allowed  current  flow  on  the  surface.  Even  for  very  simple 
geometries,  such  as  a  conesphere,  the  generation  of  these  inputs  is  a  complex  problem. 

We  present  a  MOM  algorithm  which  removes  many  of  the  constraints  and  difficulties 
associated  with  the  present  surface  current  modeling  schemes.  The  approach  requires  only 
that  the  target  surface  be  covered  with  elements,  rather  than  requiring  a  complex  (and  highly 
constrained)  interlocking  mesh  of  current  elements. 
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In  the  following  section  the  development  of  the  RCIE  is  reviewed.  To  keep  the  discussion 
brief,  only  essential  points  of  the  argument  will  be  given.  A  more  complete  treatment  of  the 
RCIE  is  to  be  found  in  Reference  [1].  The  geometry  model  and  H-field  basis  is  discussed  in 
Section  3.  In  Section  4  the  example  of  a  conesphere  is  presented  to  demonstrate  both  the 
stability  and  the  current  limitations  of  the  approach. 


2  Radiation  Condition  Integral  Equations 

We  consider  the  problem  of  scattering  from  a  body  of  finite  extent.  The  surface  of  the 
scattering  body,  5fc,  is  assumed  to  have  a  unit  normal,  n,  directed  out  of  the  volume  of 
the  body  at  every  point.  Generalizations  to  bodies  which  have  surface  singularities  (such 
as  edges,  corners  and  cusps)  is  straightforward  provided  the  surface  singularities  are  sets  of 
measure  zero  for  integration  over  Si- 

Unlike  the  Electric  (EFIE),  Magnetic  (MFIE),  or  the  Combined  (CFIE)  Field  Integral 
Equations,  the  Radiation  Condition  Integral  Equations  do  not  express  boundary  conditions 
on  the  surface  of  the  scattering  body.  Rather,  the  RCIE  are  the  radiation  condition  in  weak 
form  expressed  in  terms  of  the  total  fields  on  the  surface  of  the  scattering  body.  Since  the 
radiation  condition  is  common  to  all  scattering  problems,  the  RCIE  hold  for  all  scattering 
bodies — independently  of  any  surface  boundary  conditions.  To  obtain  a  system  of  integral 
equations  which  describe  a  particular  scattering  problem,  the  RCIE  must  be  augmented  by 
the  boundary  conditions  on  the  surface  of  the  scatterer.  Incorporating  surface  boundary 
conditions  is  straightforward  since  the  RCIE  are  expressed  in  terms  of  total  fields  on  the 
surface  of  the  scatterer. 

The  RCIE  have  their  simplest  expression  in  terms  of  the  chiral  field  combinations.  With 
an  time  dependence  Maxwell’s  time  harmonic  field  equations  take  the  form  in  a  vacuum 

VxE  =  ikB.  (1) 

VxH  =  -ikE  (2) 

where  k  =  27r/A  and  units  have  been  chosen  to  make  the  impedance  of  free  space  1.  Chiral 
field  combinations  are  defined 

=  H  +  ^E  (3) 

=  n-iE  (4) 

where  the  R  and  L  superscripts  denote  right-handed  and  left-handed  fields,  respectively.  The 
origin  of  the  labels  “right-handed”  and  “left-handed”  arises  from  the  fact  that  (4)  is  zero  for 
fields  which  are  the  sum  of  only  right-hand  circularly  polarized  plane  waves.  In  this  sense 
the  field  (3)  may  be  viewed  as  the  sum  of  all  right-hand  polarized  plane  wave  components 
of  the  field  while  (4)  is  the  sum  of  all  left-handed  components. 

To  a  large  extent  expressions  involving  either  right-handed  or  left-handed  fields  will  be  the 
same  up  to  an  occasional  sign.  To  reduce  the  number  of  expressions  we  use  the  superscript, 
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a,  to  denote  either  an  R  or  an  L.  In  expressions  for  which  the  choice  of  sign  depends  on 
handedness,  d:  and  =p  will  be  used  where  the  upper  sign  holds  for  q  =  i?  and  the  lower  sign 
holds  for  a  =  L.  Using  (1)  and  (2)  the  chiral  fields  are  seen  to  obey  two  decoupled  first 
order  differential  equations 

V  X  F“  =  ±A:F“.  (5) 

The  implications  of  (5)  are  far  reaching  and  imply  that  the  F^  and  F"^  fields  are  dynamically 
independent  exterior  to  the  scatterer.  Therefore,  in  the  far  field  the  right-handed  and  left- 
handed  fields  obey  separate  (and  independent)  radiation  conditions. 

The  chiral  fields  have  a  number  of  interesting  properties  not  shared  by  the  usual  E  and 
H-fields.  One  property  of  particular  importance  is  that 


V-(FfxFJ)  =  0  (6) 

where  Fj  and  FJ  are  any  two  solutions  of  (5)  of  the  same  handedness.  From  (6)  and  the 
divergence  theorem  it  follows  that 

X  F“)-n(/A  =  jjJ^V-{F^xF^)dV  =  Q  (7) 

where  the  boundary  surface  of  volume,  V,  is  written  dV. 

To  apply  (7)  to  scattering  problems  consider  the  volume,  V,  to  be  all  space  exterior  to  the 
scatterer.  Provided  V  is  free  of  sources,  it  may  be  shown  that  (7)  holds  with  the  boundary  of 
V  taken  as  Sb^  From  (7)  one  arrives  at  the  following  definition  for  a  weak  radiation  condition; 
Let  A"  be  the  set  of  all  solutions  to  (5)  on  V  which  obey  the  radiation  condition.  A  field 
distribution,  F,  on  Sb  satisfies  the  weak  a-handed  radiation  condition  provided 

^  (T“  X  F)  •  n  dA  =  0  (8) 

for  all  T®  6  A®.  The  fields,  A®,  are  referred  to  as  testing  fields  and  play  the  role  of  Green’s 
functions  in  the  development. 

The  weak  radiation  conditions  (8)  is  applied  to  scattering  problems  by  expressing  the 
total  chiral  fields  as  a  sum  of  the  incident  and  scattered  fields 

F“  =  +  (9) 

By  definition  the  scattered  fields  are  members  of  the  sets  A®.  Integrating  both  sides  of  (9) 
by  a  general  element  of  A®  yields  the  RCIE 

X  F“)  .  n  dA  =  Jf{T^  x  •  n  dA.  (10) 

Equations  (10)  are  linear  integral  equations  satisfied  by  the  chiral  field  distributions  on  the 
surface  of  any  scatterer.  They  are  incomplete  in  that  the  boundary  conditions  on  the  surface 
of  the  scatterer  have  not  been  specified. 
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To  incorporate  surface  boundary  conditions  into  (10)  is  straightforward.  For  perfect 
conducting  bodies  the  appropriate  surface  boundary  condition  is  Ey  =  0  on  56.  From  (3) 
and  (4) 

F«  =  Ff  =  H||  (11) 

at  every  point  on  56.  Substituting  Hy  for  the  total  chiral  fields  in  (10)  yields  the  integral 
equations 

xH)-ndA  =  x  FL«e„<) '  "  dA  (12) 

J (T^x-a)-ndA  =  J l^{T^  X  Ff„„.*„.)  •  n  dA  (13) 

which  are  solved  numerically  in  the  following  sections  for  the  tangent  H  field.  It  is  interesting 
to  note  that  the  only  coupling  between  chiral  field  components  occurs  on  the  surface  of  the 
scatterer  by  virtue  of  the  surface  boundary  condition  (11). 


3  Numerical  Algorithm 

Application  of  MOM  to  equations  (12)  and  (13)  requires  that  finite  dimensional  approxima¬ 
tions  are  made  for  the  surface  H-field  and  the  testing  fields.  Before  making  these  approx¬ 
imations  we  first  consider  the  geometric  model  or  surface  representation  problem.  Let  the 
surface,  56,  (which  is  assumed  to  be  closed)  be  represented  by  a  finite  number,  iV,  of  para¬ 
metric  patches.  A  parametric  patch  is  defined  as  a  differentiable  vector  function,  Iik{u,v)^ 
(fc  =  Ij  • .  •  j  N)  of  two  parameters  which  maps  the  region,  [0, 1]  x  [0, 1],  in  parameter  space 
onto  a  region,  Pk,  of  the  surface  Sb-  The  N  parametric  patches  are  chosen  to  cover  the 
surface  such  that  every  point  on  56  is  either  within  a  unique  patch,  Pjt,  or  on  the  common 
boundary  between  patches.  Patches,  being  non-overlapping,  permit  the  surface  integrals  in 
(12)  and  (13)  to  be  written  as  a  sum 

IjT  (T“  xll)-ndA=  'Z  -  Tk,A,u)  du  dv  (14) 

of  integrals  over  each  parametric  patch.  The  tangent  components  of  the  vector,  H,  (and 
similarly  T)  are  given  on  each  patch  by  the  projections 

ff*,„(«,j;)  =  H-^  and  n)  =  H  •  (15) 

The  pulse  basis  is  defined  by  approximating  the  functions  (15)  as  constant  over  each  patch. 

The  form  of  the  testing  fields  in  (12)  and  (13)  are  quite  general,  however,  we  will  restrict 
our  choices  to  dipole  fields  ^  for  simplicity.  A  chiral  dipole  field  is  constructed  from  the 

^In  general  all  testing  fields  may  be  expressed  as  integrals  (sums)  of  dipole  fields.  In  this  sense,  dipole 
fields  span  the  sets,  A®,  which  are  linear  vector  spaces. 
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vector  potential 


Afc  : 


~ifc|r-rfcl 


iPfc 


(16) 


47r|r-rfc| 

where  p^;  is  the  vector  dipole  moment  and  is  the  source  point  location.  In  general  the  chiral 
dipole  sources  may  be  chosen  anywhere  throughout  the  volume  of  the  scatterer.  However, 
we  have  found  the  best  performance  is  attained  by  choosing  the  testing  field  sources  on  the 
surface,  5^,  such  that  vector  dipole  moments  are  tangent  to  the  surface  at  the  location  of 
the  source. 

From  (15)  it  is  apparent  that  there  are  two  degrees  of  freedom  per  patch.  To  obtain  a 
square  system  matrix,  i.e.  to  insure  equal  numbers  of  H-field  “unknowns”  and  equations, 
one  must  use  two  chiral  dipoles  per  patch.  It  is  also  desirable  to  treat  the  right-handed  and 
left-handed  equations  on  an  equal  footing  by  selecting  one  right-handed  and  one  left-handed 
dipole  on  each  patch.  The  explicit  choices  made  in  this  work  are 


=  R;t(l/2,l/2) 
±  iCk 


(17) 

(18) 


where  the  unit  vector  ft  is  chosen  to  lie  along  the  u-tangent  vector  at  and  =  n  x 
With  this  choice  of  dipole  moments  the  radiation  pattern  of  the  dipoles  are  rotationally 
symmetric  in  the  plane  tangent  to  the  surface  at  rjt  with  a  null  along  the  inward  directed 
normal. 

For  modeling  of  general  3D  surfaces,  bicubic  [3]  parametric  patches  are  used.  Each  patch 
is  defined  by  interpolating  16  surface  points,  on  the  patch  using 


3  3 

i=0  j=Q 


(19) 


where  the  interpolation  function,  Gi{u),  are  third  order  interpolating  polynomials.  Bicubic 
parametric  patches  (19)  provide  a  very  robust  surface  model  which  is  capable  of  accurately 
modeling  curved  as  well  as  flat  surfaces.  Since  no  “connection”  is  required  with  neighboring 
patches,  the  user  need  only  specify  the  16  surface  points  for  each  patch.  The  only  restrictions 
on  the  input  data  is  that  the  patch  normal  (defined  as  the  u-tangent  vector  crossed  with  the 
u-tangent  tangent)  be  directed  out  of  the  volume  of  the  scatterer  and  that  the  patches  cover 
the  surface. 

The  resulting  2N  x  2N  matrix  equation  is  written  in  block  form  as 


(20) 


where  the  coefficients  /i“  and  are  the  constants  assumed  for  the  functions  (15).  The 
right-hand-side  vector  is  obtained  by  applying  the  divergence  theorem  to  the  right-hand-side 
of  (12)  and  (13)  with  the  assumption  that  the  incident  field  is  source  free  throughout  the 


■ 

aR,v 

^n,m 

'K  ■ 

Pn  ‘  ^/ncideni 

aL,u 

.  ■^n,m 

K 

.  Pn  ■  F/„d*„((ru)  . 
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volume  of  the  scatterer.  Matrix  elements  for  which  the  testing  field  source  is  separated  by 
some  distance  from  the  H-field  patch  {n  ^  m) 

AZ^  =  -  I'du  f'dvT:,,  (21) 

Jo  Jo 

a::i=  rduTdvT:^^  (22) 

Jo  Jo 

are  evaluated  with  guassian  quadratures.  The  “self-terms”,  for  which  the  dipole  sources  are 
on  the  patch  of  integration,  are  evaluated  by  separating  out  principle  value  and  pole  terms. 
The  pole  terms  are  the  familiar  terms  from  the  MFIE  and  take  the  form 

PZt  =  (23) 

where  the  vectors,  Dfc,  are  the  dual  vectors  to  the  patch  tangent  vectors  at  the  source  point 
location.  The  principle  value  integral  is  evaluated  by  first  subtracting  a  potential  term  of 
the  form,  where 

$  =  ^  (1  -f.  ikr  —  •  r  (24) 

4:7rkr^ 

from  the  dipole  fields.  This  reduces  the  singularity  of  the  integrand  from  l/r^  to  1/r.  The 
remaining  potential  integral  has  a  1  /r^  singularity  and  is  reduced  to  a  line  integral  around 
the  patch  boundary.  The  resulting  self- terms  are 

Al'i  =  f\u  f'dvf: 

Jo  Jo 

±  f  (^|v=0  du 

Jo 

’  ’  Jo  Jo 

±  /  (^|u=:l  —  ^|u=o)  dv 

Jo 

in  which  T"  =  T“  ±  V$.  The  remaining  singularities  in  T"  are  illiminated  by  evaluating  the 
surface  integrations  in  polar  coordinates  in  parameter  space.  While  the  choice  of  parametric 
patches  are  bicubic  surfaces,  the  integral  forms  (25)  and  (26)  are  completely  general  and  are 
valid  for  patches  of  general  curvature. 

4  Numerical  Examples 

The  generation  of  surface  meshes  for  a  conesphere  is  demonstrated  in  Figure  1.  First, 
accurate  geometric  models  of  the  scattering  target  in  terms  of  bicubic  patches  were  produced. 
These  geometric  models  are  referred  to  as  the  coarse  meshes  since  it  is  often  possible  to 
represent  curved  bodies  with  relatively  few  bicubic  patches.  The  top  two  coarse  meshes 


(25) 

(26) 
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Figure  1:  Coarse  and  fine  grain  meshes  for  conesphere 


shown  in  Figure  1  are  for  a  5  meter  long  conesphere  with  a  sphere  radius  of  0.5  meters  and 
are  comprised  of  12  and  24  patches,  respectively.  Each  bicubic  patch  is  specified  by  16  surface 
points  which  were  obtained  using  the  analytic  form  of  the  conesphere^.  Finer  grain  meshes 
were  generated  using  a  simple  subdivision  algorithm  as  follows.  Each  coarse  mesh  patch  was 
evenly  divided  in  parameter  space  into  by  subpatches.  The  number  of  subdivisions, 
Nuj  was  determined  by  multiplying  the  desired  number  of  pulses  per  unit  length  by  the  arc 
length  at  the  value  of  v  for  which  the  width  of  the  patch  is  a  maximum.  A  similar  algorithm 
is  used  to  determine  the  number,  Ay,  of  u-subdi visions.  Since  the  RCIE  algorithm  presented 
here  imposes  no  constraints  on  the  relative  position  of  element  nodes,  each  coarse  patch 
is  subdivided  independently  of  the  neighboring  coarse  patches.  This  greatly  simplifies  the 
subdivision  algorithm.  The  fine  grain  meshes  for  a  meshing  rate  of  20  pulses  per  wavelength 
at  100  Mhz  are  shown  below  the  corresponding  coarse  meshes  in  Figure  1.  Refinement  of 
the  two  coarse  meshes  lead  to  very  different  meshes.  The  refinement  of  the  12  patch  mesh 
resulted  in  a  630  patches  (Mesh  1)  while  the  refinement  of  the  24  patch  mesh  yielded  a  450 
patches  (Mesh  2).  Mesh  2  had  fewer  patches  than  Mesh  1  because  larger  patches  are  used 
at  the  cone  tip.  In  addition  Mesh  1  is  a  gridded  mesh  in  which  neighboring  patches  share 
common  nodes  which  is  not  the  case  for  Mesh  2. 

Monostatic  cross  sections  were  computed  with  the  refined  meshes  and  compared  with  a 
Body  of  Revolution  Program  based  on  the  EFIE.  The  results  of  these  calculations  are  shown 
in  Figure  2.  The  system  matrix  for  Mesh  1  was  rank  1260  and  had  an  estimated  condition 
number  of  200.  Mesh  2  resulted  in  a  rank  900  system  matrix  with  and  estimated  condition 
number  of  243.  The  maximum  difference  of  0.3dB  between  the  computeded  RCS  for  Mesh 
1  and  2  occurred  at  a  scattering  angle  of  135  degrees  in  the  theta  (horizontal)  polarization. 
This  difference  is  small  compared  to  the  4.2dB  difference  between  the  RCIE  and  EFIE  results 
at  165  degrees.  We  believe  the  EFIE  results  are  correct  since  a  convergance  study  was  made 
and  the  program  used  is  well  validated  with  range  measurements. 

Experience  with  this  and  other  scattering  targets  indicates  that,  while  the  pulse  algo¬ 
rithm  is  quite  stable  and  rapidly  convergent  for  pulse  densities  below  10  per  wavelength, 
the  convergence  for  densities  above  10  per  linear  wavelength  is  slow.  Our  current  belief  is 

^For  more  complex  surfaces,  bicubic  representations  may  be  constructed  from  IGES  data  obtained  from 
automated  drafting  programs. 


Figure  2:  Monostatic  RCS  of  conesphere  at  100  Mhz.  A  scattering  angle  of  0  degrees 
corresponds  to  illumination  of  the  cone  tip.  The  solid  curves  show  the  results  obtained  with 
an  EFIE  MOM  calculation  using  a  BOR  program.  The  dashed  and  dotted  curves  show  the 
RCIE  results  obtained  with  Mesh  1  and  Mesh  2. 

that  the  poor  convergence  for  high  pulse  densities  is  due  to  the  curl  free  nature  of  the  pulse 
basis.  The  pulse  H-field  bases  are  not  expected  to  model  surface  waves  effectively  since  these 
phenomena  involve  the  normal  electric  field  which  is  the  curl  of  the  tangent  H-field. 

A  natural  remedy  for  the  above  limitation  is  to  add  linearly  independent  basis  elements 
which  have  a  nonvanishing  curl.  We  are  currently  pursuing  this  approach  with  the  hope  of 
improving  the  convergence  rate  while  not  sacrificing  the  robustness  and  ease  of  use  of  the 
pulse  basis. 
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1  Introduction 


To  predict  reflection  of  radar  waves  from  objects  and  other  radar  radiation  patterns  it  is  necessary 
to  compute  high  frequency  electromagnetic  waves  in  arbitrary  domains.  Since  the  wavelength  of  the 
electromagnetic  waves  is  in  general  very  small  compared  to  the  size  of  the  physical  objects  of  interest, 
direct  finite  difference,  finite  element  methods  and  boundary  integral  methods  for  the  Maxwell’s  equations 
are  computationally  very  expensive  and  in  some  situations  not  practical.  Approximate  methods  have  been 
used  extensively  for  this  type  of  calculations.  We  have  developed  finite  difference  methods  for  solving 
the  nonlinear  partial  differential  equations  that  approximate  the  high  frequency  solutions  of  the  scalar 
wave  equation,  starting  from  the  geometrical  optics  approximation  and  then  continuing  to  the  classical 
asymptotic  expansion  and  a  new  perturbed  geometrical  optics  system.  These  methods  potentially  will 
replace  ray  tracing  methods  for  high  frequency  calculations  in  many  cases  and  are  applicable  directly  for 
caJculation  of  electromagnetic  waves. 

The  Maxwell’s  equations  for  electric  and  magnetic  fields  in  an  isotropic  homogeneous  material  are 
written  as: 

(e£),  =  cVx^  -4ttJ  (1) 


=  VxE 


(2) 


V-{iE)  =  Aivp  (3) 

V*(/r^)  =  0  (4) 


where  E  is  the  electric  field,  H  is  the  magnetic  field,  J  is  the  current,  p  is  the  charge  density,  c  is  the  speed 
of  light  in  the  medium;  p  and  e  are  respectively  electric  permittivity  and  magnetic  permeability.  The 
electromagnetic  waves  are  solutions  of  the  following  w'ave  equations,  derived  from  Maxwell’s  equations, 
for  El  and  vectors.  ^ 

Eu  =  -AE  (5) 

ep 

Hu  =  —AH  (6) 

tp 

Special  time  harmonic  solutions  of  Maxwell’s  equations  can  be  obtained  by  assuming  solutions  of  the 
form, 

'  E  =  H  = 

*This  research  was  supported  by  ONR  #  N00014-91-J-1034,  NSF  DMS  94-04942, 
and  ARPA  URI-ONR-N00014-92-J-1890 
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where  <l>  function  is  the  phase  of  the  wave  and  u  is  the  frequency  of  the  wave.  These  solutions  locally 
resemble  a  high  frequency  plane  wave  and  arise  in  important  practical  applications. 

By  substituting  the  above  ansatz,  one  can  show  that  the  phase  satisfies  the  eikonal  equation  [1]. 

(V0)^  =  ^  (^) 

The  eikonaJ  equation  is  a  first  order  nonlinear  PDE,  a  Hamilton- Jacobi  type  equation.  The  usual  practice 
has  been  to  solve  the  eikonal  equation  by  ray  tracing  methods.  Ray  tracing  in  this  context  is  nothing  but 
solving  the  eikonal  equation  using  the  method  of  characteristics  for  PDEs.  Our  approach  relies  on  a  direct 
discretization  of  the  eikonal  equation  using  the  recently  developed  numerical  methods  for  Hamilton- Jacobi 
equations  [4].  From  the  above  ansatz  one  also  derives  transport  equations  for  amplitude  of  the  wave.  The 
transport  equations  are  of  hyperbolic  type  and  we  solve  them  using  high  order  upwind  numerical  methods 
developed  for  hyperbolic  equations. 

To  develop  the  methods,  here  we  concentrate  on  the  scalar  linear  wave  equation  in  two  spatial  variables, 
a  scalar  version  of  the  wave  equations  for  electromagnetic  waves.  Any  method  developed  for  the  scalar 
wave  equation  is  applicable  directly  for  solving  the  electromagnetic  wave  equations. 

lift  =  c^An  =  +  Sy) 

X  and  y  are  the  spatial  variables,  t  is  time,  u  is  the  amplitude  of  the  wave  and  c(a:,y)  is  the  speed  of  the 
wave  in  the  medium.  We  consider  time  harmonic  solutions  of  the  form 


u{x^y,t)  =  e^'^^v{x,y).  (9) 

Here  cj  is  the  frequency  of  the  wave  as  imposed  by  the  boundary  conditions.  For  time  harmonic  solutions 
the  wave  equation  is  reduced  to  the  Helmholtz  equation. 

^2 

Au -I- =  Av  +  j/)i;  =  0  (10) 

n  is  the  index  of  refraction  and  the  non-dimensional  quantity  A:“  =  ^  ^  represents  relative  size  of 

the  wavelength  of  the  wave  with  respect  to  the  physical  size  of  the  problem.  The  fundamental  difficulty 
in  solving  the  Helmholtz  equation  for  large  values  of  parameter  k  is  the  fact  that  the  necessary  mesh  size 
in  each  coordinate  direction  is  proportional  to  To  resolve  each  wavelength  one  approximately  needs 
ten  points.  For  a  three  dimensional  calculation  one  needs  approximately  O((10A:)^)  number  of  points  to 
resolve  the  solution.  Values  oi  k  =  1000  are  common  for  many  applications. 

Since  direct  solutions  are  not  practical,  approximate  methods  are  of  special  interest.  A  qualitatively 
correct  approximation  is  the  geometrical  optics  approximation,  which  is  correct  in  the  limit  of  infinite 
frequency,  equivalent  to  zero  wavelength.  The  geometrical  optics  could  be  derived  formally  by  considering 
a  solution  of  the  form 

u  =  (11) 

for  the  Helmholtz  equation.  The  amplitude,  A,  and  the  phase,  <f>,  satisfy 

f  (12) 

\  Q^2VA^V4>-\-A/\4> 

This  approximation  is  valid  for  infinite  frequency  and  it  does  not  include  all  solutions  of  the  Helmholtz 
equation,  notably  diffraction  phenomena  are  not  represented  by  the  above  equations. 
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A  substantial  correction  to  the  above  approximation  could  be  achieved  by  expanding  solutions  of  the 
Helmholtz  equation  around  zero  wavelength,  in  inverse  powers  of  k,  in  the  following  form, 

oo 

u(a:,t/)  =  (13) 

n  =  0 

By  substituting  the  above  expansion  in  the  Helmholtz  equation  and  equating  the  coefficients  of  different 
powers  of  A:,  partial  differential  equations  for  the  functions  Uq,  Vj,  etc  are  derived  [3]. 

r  0  = 

I  0  =  2c2V(^-Vvo4-c2uoA^  (14) 

[  0  =  2c^V(^  • 

The  asymptotic  expansion,  since  it  is  a  correction  to  the  geometrical  optics,  also  fails  where  geometrical 
optics  fails,  notably  on  shadow  lines.  The  representation  (13)  requires  a  unique  phase  to  be  defined  and 
it  does  not  allow  for  crossing  rays. 

We  have  investigated  a  new  approximation  which  we  shall  name  perturbed  geometrical  optics  equa¬ 
tions.  This  system  is  the  geometrical  optics  equations  including  the  usually  omitted  terms  of  order  l/k~. 

I  0  =  lV^I^-n^-i-(l/k^)(AA/A) 

[  0  =  2VA- Vd>+ AA(?^  ^  ^ 

Analytical  results  suggest  that  these  equations  are  equivalent  to  the  Helmholtz  equation  in  a  compact 
simply  connected  domain  as  long  as  the  amplitude  of  the  solution,  [-ul,  is  bounded  away  from  zero. 
Numerical  results  show  that  this  system  has  qualitatively  correct  solutions  near  shadow  lines. 


2  Numerical  Methods 


We  use  finite  difference  methods  to  solve  the  nonlinear  PDEs  that  we  have  discussed.  We  use  upwind 
methods  for  discretization  due  to  the  hyperbolic  nature  of  the  equations.  To  achieve  higher  order  accuracy 
we  use  ENO  scheme  (  essentially  non-oscillatory)  for  calculating  derivatives.  The  ENO  method  is  based  on 
an  adaptive  stencil  to  calculate  the  derivative  of  a  function  defined  on  the  grid.  Use  of  ENO  method  allows 
us  to  calculate  solutions  of  the  eikonal  equation  with  discontinuous  first  derivatives  without  smoothing 
the  solution.  We  use  the  follow’ing  notation  to  denote  the  discretization.  Ax,  Ay,  and  At  are  the  mesh 
sizes.  <l>ij  is  the  numerical  approximation  to  the  solution  of  the  eikonal  equation. 

4>?j  -  =  <;6(iAx,iA2/,nA/)  (16) 

Also  we  use  standard  notation  for  forward,  backward,  and  centered  differences. 


D,  (pij  =  ■ 


= 


The  most  important  equation  of  the  three  approximations  that  we  are  considering  is  the  eikonal 
equation.  Our  numerical  algorithm  for  the  eikonal  equation  is  developed  based  on  the  recently  devised 
numerical  methods  for  Hamilton- Jacobi  type  equations.  To  solve  the  eikonal  equation  we  introduce  time 
as  an  artificial  variable. 


(18) 
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The  discretization  in  time  is  of  the  forward  Euler  type. 


The  numerical  flux,  denoted  by  G,  is  calculated  based  on  the  exact  or  an  approximate  solution  of  a 
Riemann  problem  for  the  eikonaJ  equation.  We  use  two  different  numerical  fluxes,  one  is  a  Lax- Friedrichs 
type  method  and  the  second  is  a  Godunov  type.  The  Lax- Friedrichs  flux  is  defined  as 

Gir(u+,  ii“, U-)  =  +  u-)/2,  (u+  +  v-)/2)  +  a,(u+  -  u')  a,{v^  +  u")  (20) 

Where  and  are  local  upper  bounds  for  absolute  values  of  the  partial  derivatives  of  the  Hamiltonian 
Eiu,v). 

1-^1  <  Gr  for  ue{u‘,u'^)  (21) 

In  the  second  method  the  flux  is  defined  through  a  third  order  ENO  interpolation  and  a  Godunov  type 
Riemann  solver. 


<l>ir  =  C-  +  J, 


(22) 


The  values  of  the  derivatives,  ^  are  calculated  based  on  third  order  interpolation  with  an  adaptive 

stencil  [4].  The  Godunov  flux  is  defined  by 


GG(u+,ii",u+,'y“)  =  ext„g/(y- ^+)  exti,g/(,,-_t,+)  H{u,v) 


(23) 


H{u,v)  =  cy/u^~r^  and  I(a,b)=[  min(a,b),  max(a,b)]  .  The  function  ext  is  defined  by 

ext»,g;(a,j)  =  max^,g/(a,i,)  if  a  <  6,  (24) 

exty6/(a.6)  =  min,,g/(^,6)  if  b  <  a.  (25) 

Note  that  in  general  the  operations  of  taking  max  and  min  do  not  commute  and  the  Godunov  flux  in  not 
always  uniquely  defined.  But  for  many  cases,  including  our  J¥(u,u),  the  flux  is  uniquely  defined.  Use  of 
a  third  order  ENO  interpolation  and  the  exact  Riemann  solver  results  in  the  excellent  resolution  of  the 
discontinuities  in  the  solution  of  the  phase. 

The  time  dependent  transport  equations  are  hyperbolic  equations  that  govern  conservation  of  energy 
for  the  wave.  For  solving  the  transport  equations  we  use  upwind  schemes.  Higher  order  ENO  methods 
are  easily  implemented  in  our  codes.  We  use  centered  differences  to  calculate  the  velocity  vector  field, 
V<^,  from  the  discrete  values  of  0. 

=  (26) 

The  above  values  are  used  for  an  upwind  discretization  of  the  transport  equations.  For  example  the  term 
4>x  '  IS  discretized  as  •  D^Aij  where 

if  Di4>i,j  >  0 

D^,A?j  =  if  DU?,j  <  0  (27) 

Note  that  for  a  first  order  stencil  is  just  Z)f . 

We  use  a  time  marching  scheme  to  solve  the  perturbed  geometrical  optics  system.  The  time  march¬ 
ing  corresponds  to  solving  the  Helmholtz  equation  as  steady  state  solution  of  a  linear  time  dependent 
Schroedinger  equation.  Here  we  only  consider  discretization  of  the  following  system  for  variables(t;,^), 
A  =  e\  The  discretization  for  variables  {A,(f>)  is  similar.  We  discretize  the  following  system. 
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/  <i>^  =  -  (l/k^){Av  +  I 

\  Vr  =^2VvV<P  +  A<l>  ^  ^ 

Since  we  are  interested  only  in  the  steady  state  case  we  consider  a  simple  time  discretization.  The 
linearized  system  is  of  Schroedinger  type  and  a  forward  Euler  time  discretization  of  the  system  will  be 
unstable.  We  do  a  mixed  discretization  in  time  of  the  system.  We  treat  the  linear  part  implicitly  and  the 
nonlinear  part  explicitly.  This  will  enable  us  to  have  time  steps  which  are  proportional  to  Aa:,  since  we 
are  treating  the  nonlinear  transport  terms  explicitly.  The  numerical  scheme  for  the  system  is: 


(  4>lf^  -f  (l/F)At(Z)+Z)j<ti  +  DtO-v^f^)  = 

]  -  At{D;Dt4>-+^  +  = 

[  2AtD^v^j  •  +  2AtD;;vPj  • 


This  discretization  is  stable  for  Ai  <  cAx  from  the  explicit  discretization  of  the  transport  equations.  In 
practice  we  found  out  that  we  could  use  the  above  scheme  in  the  range  10  <  fc  <  oo. 


3  Numerical  Results 

We  consider  reflection  of  an  incident  wave  olf  a  cylinder  with  radius  one.  The  source  of  the  wave  is  at 
point  (0,7).  We  use  polar  coordinates  for  calculations.  The  physical  problem  is  posed  in  the  entire 
but  our  computational  domain  is  [1,6]  x  [~7r,7r].  The  solution  is  assumed  to  be  of  the  form 

u  =  (30) 

The  geometrical  optics  equations  for  the  variables  AifA2,<f>i,  and  </>2  are  solved  and  then  the  results  are 
added  together  to  obtain  the  solution.  The  two  systems  are  coupled  through  the  boundary  conditions 
at  the  surface  of  the  cylinder.  We  specify  Dirichlet  boundary  conditions  for  phase  and  amplitude  of  the 
incident  wave  in  the  region  of  the  domain  where  it  is  illuminated  by  the  source.  Everywhere  else  the 
boundary  conditions  are  based  on  the  local  direction  of  the  characteristics.  The  phase  of  the  reflected 
wave  at  the  surface  of  the  cylinder  is  equal  to  the  incident  wave  and  therefore  at  r  =  1  we  specify  the 
boundary  conditions  for  4)2  and  A2  according  to: 

02(1,^)  =  4>l{^.0)  AnilJ)  =  4i(l,^).  (31) 

The  problem  is  solved  numerically  in  (r,  ^)  space  and  the  results  are  interpolated  to  a  Cartesian  grid.  The 

equations  for  variables  Ai^A2i4>\^  a-nd  4>2  ^re  valid  in  the  limit  of  infinite  k  but  we  have  used  =  3  to 

generate  figure  1  to  compare  our  solution  with  an  exact  solution  for  ^  —  3.  In  the  next  figure  we  present 
the  exact  solution  computed  from  the  expansion  of  the  solution  in  Bessel  functions. 

The  multivalued  solutions  of  the  eikonal  equation  in  domains  with  constant  index  of  refraction  outside 
a  convex  domain  have  a  simple  structure  which  could  be  exploited.  If  one  considers  a  boundary  fitted 
coordinate  system  where  one  of  the  coordinates  is  radially  outward  from  center  of  the  body  and  the  other 
two  coordinates  are  local  coordinates  on  the  surface  of  the  object,  then  the  incident  and  reflected  solutions 
of  the  eikonal  equation  become  unique  in  this  coordinate  system.  If  we  consider  reflection  of  a  wave  from 
a  cylinder  we  can  extend  the  domain  for  0  from  (-7r,7r)  to  the  whole  real  line.  The  multivalued  solutions 
in  this  case  are  numerated  by  integers.  For  computational  purposes  we  have  to  cut  the  domain.  We 
show  the  results  of  our  calculation  in  the  domain  (— 27r,  27r)  for  the  phase  and  log  of  the  amplitude  of  the 
reflected  wave. 
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The  last  computed  example  corresponds  to  calculation  of  the  phase  and  amplitude  around  a  shadow 
line  using  the  perturbed  geometrical  optics  system.  We  set  up  the  problem  in  the  domain  [— 1, 1]  x  [-1, 1]. 
We  specify  Dirichlet  boundary  conditions  on  one  side  and  Sommerfeld  radiation  boundary  conditions  on 
rest  of  the  domain.  Let  us  recall  that  that  the  solution  u  is 

u  =  (32) 


For  Dirichlet  boundary  conditions  we  specify 

u(-l,y)  =  0  =  if  y>0  ,^3) 

=  -10  if  y  <  0 

For  the  rest  of  the  boundary  we  specify  ■  n  and  Vu  •  n.  Since  the  values  of  ■  n  and  Vu  •  n  are 
not  known  a  priori,  we  extrapolate  the  values  from  neighboring  points.  The  obtained  amplitude  is  show'n 
in  figure  5.  Note  that  we  have  used  the  value  A:  =  100  in  this  calculation  and  that  the  amplitude  shows 
the  typical  oscillatory  behavior  around  the  shadow  line  (compare  with  [1]  p.  434  ).  In  the  last  figure  we 
compute  the  intensity  of  the  diffracted  wave  near  an  straight  edge  using  the  Fresnel  approximation  ([1] 
p.  434  ). 
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Amplitude  of  the  solution 


Figure  1:  Computed  solution  outside  a  cylinder 


Amplitude  of  the  solution 


Figure  2:  Exact  solution  outside  a  cylinder 
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Phase  of  the  reflected  wave 


Figure  3:  Multivalued  phase  of  the  reflected  wave  outside  a  cylinder 


Amplitude  of  the  reflected  wave 


Figure  4:  Multivalued  amplitude  of  the  reflected  wave  outside  a  cylinder 


AmplHixlo,  krlOO 


Figure  5:  Solution  of  the  perturbed  geometrical  optics  system  around  a  shadow  line 


Ftesn«t  diflractcn  pattam  of  a  straight  adga 


Figure  6:  Fresnel  diffraction  pattern  of  a  straight  edge 
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Toeplitz  matrices  and  block  multilevel  Toeplitz  matrices  frequently  arise  in  many  CEM  prob¬ 
lems  (antennas,  tomography,  geophysical  applications,  radar  cross  section  computations,  and  so 
on).  Usually  Toeplitz  matrices  appear  when  a  problem  under  consideration  possesses  some  sym¬ 
metries.  However,  we  present  the  results  which  clearly  show  that  block  Toeplitz  matrices  can 
underlie  efficient  iterative  solution  methods  for  problems  which  possess  no  evident  symmetries. 

Consider  the  diffraction  problem  of  an  electromagnetic  field  on  a  bounded  inhomogeneuos 
obstacle  V  with  the  complex- valued  permittivity  £(M)  and  the  permeability  //(M),  which  are 
assumed  to  be  piecewise  continuous  functions  ofM  €  V.  The  electromagnetic  field  should  satisfy: 

(1)  the  Maxwell  equations  at  the  points  where  £(M)  and  p  (M)  are  continuous 


=  —iueEj 


Mev, 


here  jo  is  an  extraneous  current  source  of  the  electromagnetic  field,  Sq,  fiQ  are  the  permittivity 
and  the  permeability  of  the  free  space; 

(2)  the  continuity  conditions  for  the  tangential  components  of  E  and  H  on  the  medium  bound¬ 
aries  including  all  discontinuity  points  of  e  and  fi; 

(3)  the  radiation  conditions  at  infinity  (for  the  scattered  field). 

Let  be  the  electromagnetic  field  (induced  by  jo)  satisfying  the  Maxwell  equations 


rot  =  icufioH^y 

rot  =  —iusoE^  -f  jo, 


M  e 


with  no  obstacle.  Then  the  scattered  fields 


E^^E--E\H^  =  H-H^ 
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satisfy  the  following  equations 


rot  +  2u;(//  —  /zo)^  =  iujfioH^  + 

rot  =  —iuJSoE^  —  i<jj{e  —  €q)E  =  —iueoE^  —  j^. 

Introducing  vector  potentials 

A®(Q)  =  j  G{Q,  P)f  {P)dVp,  A^{Q)  =  j  G[Q,  P)j^  {P)dVp, 

V  V 


where 

G{Q,  P)  =  R  =  \QPl  ko  - 

we  obtain 

E^  =  (grad  div  +  k^)  -(-  rot 

=  —rot  ^  (grad  div  -j-  k^)  A^. 

Setting 

we  arrive  at  the  following  integro- differential  equations 

E  —  E^  =  (graddiv  +  k^)  f  G{£r  —  l)EdV  +  ikoWovot  f  G{fjir  ~  l)HdVy 
H  -H°  =  -i^  rot  JG{sr  -l)EdV  +  (graddiv  +  k^)  S  Glfi,  -l)HdV. 

°  V  V 

Using  Mikhlin’s  formulas  for  the  differentiation  of  weak  singular  integrals  we  obtain 


(1) 


(2) 


graddiv  j  GFdV  =  ^  graddiv  (GF)dV  - 
V  V 


rot  J  GFdV  =  J[VG,  F]dV 

V  V 

(the  right-hand  side  integral  in  the  first  equation  should  be  understood  in  the  Cauchy  principal 
value  sense)  and  then  we  rewrite  (2)  in  the  form 

(i(£.-i)  +  i)£; 

-f (graddiv  +kl)G{er-l)EdV -ikoWoJlVG,  {fir- l)H]dV  =  E°, 

^  (3) 

-Kgraddiv +ki)G{fir-l)HdV +  i^flVG,{er-l)E]dV  =  H°. 

V  °  V 


The  vector  fields  E  and  H  satisfy  the  Maxwell  equations  so  long  as  they  solve  (3).  The  scattered 
fields  satisfy  the  radiation  conditions  thanks  to  the  properties  of  the  kernel  G.  It  remains  only  to 
verify  that  the  tangential  components  of  the  fields  are  continuous  at  all  the  medium  boundaries. 
This  property  can  be  guaranteed  so  long  as  V  is  a  union  of  a  finite  number  of  subdomains  and  £r, 
firy  E,  H  are  continuously  differentiable  functions  in  each  of  them. 


182 


A  standard  approach  to  solution  of  (3)  requires  in  general  solution  of  systems  of  linear  alge¬ 
braic  equations  with  dense  coefficient  matrices.  From  the  commonly  adopted  point  of  view  it  is 
hard  to  believe  due  to  size  cosiderations  that  it  is  possible  to  treat  the  cases  of  relatively  large 
obstacles  using  3D  volume  integral  equations.  However,  we  present  an  approach  that  allows  one 
to  cover  effectively  the  case  of  really  large  obstacles.  This  approach  is  bctsed  first  of  all  on  a  change 
of  unknown  vector  variables  in  (3)  and  then  on  a  specific  reduction  technique  producing  linear 
algebraic  systems  with  properly  parameterized  dense  coefficient  matrices.  In  fact  the  coefficient 
matrices  are  embedded  into  block  multilevel  Toeplitz  matrices.  It  allows  us  to  make  storage  re¬ 
quirements  to  be  only  linearly  dependent  on  the  coefficients  matrix  size  and  to  reduce  dramatically 
the  arithmetic  complexity  of  the  coefficient  matrix- vector  multiplication  due  to  exploitation  of  the 
FFT  technique  (see  [1]), 

Let  us  first  change  the  variables: 

E  =  (e.  -  1)E,  l)Hwo,  H°  =  -woH°. 


Omitting  for  simplicity  the  tilde  sign  we  obtain  the  following  integral  equations 

(I  +  ^ 

-Jigrsiddiv +kl)GEdV -ifkolVG,  H]dV  =  E°, 

V  V 

-  (I  +  H 

+  /(graddiv  +  kl)GHdV  -  i  /  ibo[VG,  E]dV  =  H°. 


(4) 


These  equations  are  better  to  deal  with  than  (3)  since  they  involve  integral  operators  with  kernels 
which  do  not  depend  on  £r  and  fir- 

We  now  embed  the  region  V  in  a  cuboid  11,  take  up  a  uniform  cubic  mesh  on  11  and  approximate 
K  by  a  union  14  of  properly  chosen  mesh  cells.  We  then  apply  the  Galerkin  method  with  piecewise 
constant  test  functions  supported  on  these  mesh  cells.  The  Galerkin  procedure  results  in  a  linear 
algebraic  system 

Ax  =  b 

with  a  complex  symmetric  coefficient  matrix  A.  If  V  is  equal  to  11  we  have 


A  =  D  +  T, 

where  Z)  is  a  diagonal  matrix  while  T*  is  a  block  three-level  Toeplitz  matrix.  In  the  general  case, 
when  admits  the  presentation 

A  =  DA  P'^TP,  (5) 

where  P  is  a  diagonal  matrix  of  order  n  while  T  is  a  block  three-level  Toeplitz  matrix  of  order 
m  >  n  and  P  is  an  m  x  n  matrix  with  zeroes  everywhere  except  for  a  square  n  x  n  permutation 
submatrix. 

Due  to  presentation  (5)  A  can  be  multiplied  by  a  vector  with  only  O  (n  log  n)  arithmetic  op¬ 
erations.  Such  a  multiplication  procedure  is  based  on  the  FFT  algorithm  and  on  the  relation 
between  Toeplitz  and  circulant  matrices.  The  complexity  of  conventional  iterative  algorithms  for 
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solving  dense  linear  algebraic  systems  depends  crucially  on  the  costs  of  a  matrix-vector  multipli¬ 
cation  procedure.  The  FFT-based  procedure  surely  provides  a  very  efficient  implementation  of 
each  iteration  step. 

We  present  numerical  results  for  three  sample  problems  with  dielectric  obstacles.  In  each  case 
we  considered  the  symmetric  with  respect  to  the  Z-axis  configuration  and  the  incident  plane  wave 
comings  along  the  Z-axis. 

Problem  1.  Consider  a  penetrable  two-layered  ball  with  radii  rj  =  2,  r2  =  4  and  £r  ~  S  for 
0  <  r  <  Ti  and  Cr  =  2  for  ri  <  r  <  r2;  fco  =  0.8.  Figure  1  shows  comparison  of  two  graphs  of 
the  analytical  and  the  computed  components  \Ex\  of  the  electric  field  along  the  X-axis  inside  the 
obstacle. 

Problem  2.  Consider  a  penetrable  two-layered  ball  with  radii  ri  =  0.163,  r2  =  0.314  and 

=  28  +  il62  for  0  <  r  <  ri  and  £,.  =  7  +  il2  for  ri  <  r  <  r2;  A;o  =  1  (cf.  [2]).  Figure  2  shows 
comparison  of  the  computed  and  the  analytical  components  \Ex\  along  the  X-axis.  Figure  3 
shows  comparison  of  the  computed  and  the  analytical  bistatic  cross  sections  cr/A^  for  the  0  —  0 
polarization. 

Problem  3.  Consider  a  three-layered  cube  with  the  edge  length  a  =  1  and  ko  =  I,  The 
layers  have  their  boundaries  orthogonal  to  the  Z-axis.  The  widths  of  the  layers  are  equal  to 
0.25,  0.5  and  0.25  while  their  relative  permittivities  are  equal  to  3,  9,  12,  respectively.  Figure  4 
shows  comparison  of  two  diagrams  of  the  bistatic  cross  sections  obtained  by  the  suggested 
numerical  procedure  proposed  using  two  different  mesh  sizes  (32  X  32  x  32  and  64  X  64  X  64). 

Numerical  experiments  show  that  the  approach  proposed  allows  one  to  rather  accurately  com¬ 
puted  the  scattered  electromagnetic  field  inside  strongly  inhomogeneous  and  rather  large  objects. 
They  also  demonstrate  that  piecewise  constant  vector  test  functions  are  still  functions  of  very  rea¬ 
sonable  choice.  This  conclusion  apparently  contradicts  the  conclusion  made  in  [2],  where  piecewise 
linear  functions  were  strongly  advocated  to  substitute  piecewise  constant  ones.  Another  very  im¬ 
portant  implication  from  the  numerical  experiments  is  that  the  GMRES-based  solver  requires  a 
rather  modest  number  of  iterations  to  achieve  a  prescribed  accuracy.  Hence,  the  approach  pro¬ 
posed  permits  one  to  solve  very  effectively  dense  linear  systems  with  1  to  10  millions  equations. 

Theoretically  the  above  solution  strategy  can  be  applied  even  to  the  case  when  the  permittivity 
and  the  permeability  vary  within  a  wide  margin.  However,  in  those  cases  this  strategy  may  require 
such  a  fine  mesh  that  we  cannot  afford.  We  are  interested  to  present  also  a  generalization  of  this 
solution  strategy  which  is  free  from  this  drawback.  To  this  end  we  consider  a  somewhat  limiting 
case  of  a  perfect  conducting  metal  body  covered  by  a  bounded  inhomogeneous  dielectric  layer. 

Let  V  denote  the  dielectric  layer  region  and  let  S  denote  the  metal  body  surface.  The  scattered 
fields  should  satisfy  equations  (1)  but  when  trying  to  present  the  fields  using  vector  potentials  we 
now  need  to  take  into  account  also  the  fields  from  the  electric  current  on  the  metal  surface  S: 


- (grad  div  -}- 

U£0 


hi)  Jofdv  - 


-f- 


- (grad  div  +  Atq)  f  GJ^dS 

s 

rot  J  Gj^dV, 


-rot  jcfiV-xoi  jcJ^dS 

W  5 


I 

- (grad  div  -f- 

UJfiO 
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where  and  Vm  denote  the  subregions  of  V  with  e  ^  €o  and  fx  ^  fiQ,  respectively.  The  perfect 
conductor  condition 

results  in  the  following  equation  on  S 

— (l-div+fco')  fofdV  +  — (|-div+fc^)  foj^dS 
u;6o  or  J  usq  Ot  J 

s 

+  {t,  J  rot  {Gj^)dV)  =  E°. 

Vm 

As  previously,  having  changed  the  vector  variables  and  keeping  the  same  notation  for  the  new 
variables  we  finally  obtain  the  following  system  of  integral  equations 


+  J(gr^ddiv +kl)iGE)dV  - 

-(^  +  +  J  (grad  div  +  kl)  {GH)d  V  - 

Vm 

+ 

-J  i^div  +kl){GE)dV  + 

V, 


^  y(graddiv  +kl){GJ^)dS 
i  J  fco[VG,  H]dV  =  E°, 

Vm 

i  j  ko[VG,  E]dV 

woJ[VG,J^]dS  =  H°, 
s 

—i^div+kl)  fcj^dS 

UJSq  ot  j 

ikoir,  J [VG,  H]dV  =  E°. 

Vm 


To  solve  this  system  we  apply  the  Galerkin  method  using  the  same  piecewise  constant  vector 
test  functions  to  approximate  E  and  H  and  the  Rao- Wilton- Glisson  functions  to  approximate 
on  the  metal  surface.  The  coefficient  matrix  of  the  resulting  algebraic  system  takes  the  form 


A  = 


y  c  ■ 
s 


where  V  and  S  correspond  to  the  dielectric-dielectric  influence  coefficients  and  the  metal-metal 
ones,  respectively,  while  C  accounts  for  the  coupling.  To  solve  the  algebraic  system  we  use  the 
GMRES-based  solver  with  a  specific  preconditioning  strategy. 

We  present  sample  numerical  results  demonstrating  an  agreement  of  the  computed  solution 
with  the  analytical  solution. 

Problem  4.  Consider  a  metal  ball  of  radius  ri  =  0.1  with  a  penetrable  dielectric  spherical 
layer  ri  <  r  <  r2  whith  r2  =  0.4  and  €r  =4;  fco  =  1-  Figure  5  shows  comparison  of  two 
diagrams  of  the  bistatic  cross  sections  a/X^  obtained  using  the  numerical  procedure  proposed  and 
the  analytical  solution. 
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We  observe  a  rather  well  agreement  between  the  analytical  and  the  computed  solutions.  The 
numerical  experiments  thus  justify  the  robustness  of  the  approach  in  the  case  of  highly  inhomo¬ 
geneous  objects.  In  contrast  to  [2]  we  see  that  from  the  practical  point  of  view  it  is  not  at  all 
mandatory  to  turn  to  weak  formulations  and  then  to  refuse  from  using  piecewise  constant  test 
functions.  Actually  being  simpler  than  whatever  else  functions,  the  piecewise  constant  functions 
prove  to  be  a  reliable  basis  for  the  Galerkin  method  applied  straighforwardly  to  ’’strong”  integral 
formulations. 

The  proposed  approach  seems  to  be  especially  well  suited  for  the  case  when  the  metal  body 
constitutes  a  relatively  small  inclusion  inside  a  (possibly  rather  large)  dielectric  region.  In  all  cases 
really  large  linear  algebraic  systems  with  dense  coefficient  matrices  can  be  solved  very  effectively 
by  the  proposed  block-Toeplitz-structure-based  solution  strategy. 

The  authors  gratefully  acknowledge  the  helpful  contribution  by  A.Kukk  in  this  research  and 
especially  his  assistance  in  conducting  numerical  experiments.  Special  thanks  also  goes  to  A. 
Yeremin  for  his  help  to  improve  the  presentation. 
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Figure  4:  Problem  3,  cr/A^ 


Figure  5:  Problem  4,  cr/A^ 
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Abstract 

The  frequency  domain  analysis  of  several  open  region  scattering  bodies  illuminated  by  transverse 
electric  (TE)  plane  waves  is  accomplished  using  the  two-dimensional  finite  integral  technique  (FIT) 
combined  with  the  mecisured  equation  of  invariance  (MEI).  The  resulting  FIT/MEI  solutions  are 
compared  with  the  well  known  method  of  moment  (MoM)  solutions  for  validation. 


I  Introduction 

The  finite  integral  technique  (FIT)  uses  the  integral  form  of  Maxwell’s  equations  applied  to  a  discrete 
conformal  grid.  The  scattering  structure  and  the  surrounding  region  is  subdivided  into  elements,  in  this 
case  triangles,  and  constitutive  material  parameters  are  assigned  to  each  triangle  thus  facilitating  the 
specification  of  an  inhomogeneous  body.  Then,  Maxwell’s  equations,  in  the  integral  form,  are  applied  to 
each  element  to  generate  a  sparse  matrix.  The  measured  equation  of  invariance  (MEI)  is  used  to  truncate 
the  problem  space  close  to  the  scattering  body. 

II  The  Finite  Integral  Technique 

The  finite  integral  technique  directly  applies  the  Maxwell’s  equations  in  integral  form  to  a  discrete  grid 
[1].  The  unknown  quantities  are  approximated  by  explicit  basis  functions,  and  therefore  the  technique 
is  easily  derived  and  implemented.  Also,  use  of  the  integral  form  of  Maxwell’s  equation  provides  a 
smoothing  effect  on  the  problem  solution. 

Beginning  with  Maxwell’s  equations 

V  X  if  =  -jcjeE 
V  X  E  = 

the  two-dimensional  integral  form  (no  z  variation)  may  be  easily  derived,  given  by 

J  Ef  dl  =  —jcj  J  fiHzds 

-  Hi  =  jw  j  eEt  ■  ndl  (1) 

where  is  the  transverse  electric  (TE)  field,  H^  is  the  ^-directed  magnetic  field,  and  H^^'^  and  Hi  are 
the  magnetic  fields  at  two  adjacent  node  locations  i-f-l  and  i  in  the  mesh,  respectively. 

Applying  the  appropriate  basis  functions  and  simplifying  results  in  an  equation  which,  if  applied  to 
every  node  in  the  discrete  mesh,  generates  an  edge  based  sparse  finite  integral  matrix  that  can  then  be 
solved  for  the  unknown  quantities  [1]. 
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III  The  Measured  equation  of  invariance 

The  measured  equation  of  invariance  [2,3]  technique,  applied  to  the  boundary  nodes  of  a  discrete  grid 
scheme,  enforces  the  radiation  condition  for  open  region  scattering  problems. 

Using  the  static  case  as  an  example,  assuming  we  have  an  open  region  perfect  electric  conductor 
(PEC)  with  a  charge  density  on  the  PEC,  we  can  find  the  potential  V(p)  at  any  point  in  space  using 

where  p'  and  p  represent  the  position  vectors  to  the  source  and  field  points  respectively,  and  is  the 
charge  density.  Also,  at  each  boundary  node,  an  equation  relating  the  boundary  nodes  potential  with 
the  neighboring  nodes  potentials  can  be  defined  as 

N 

=  0  (3) 

i=l 

where  Vi  is  the  unknown  voltage  at  node  i,  aj  is  an  unknown  weighting  coefficient  associated  with  Vj, 
and  N  is  the  number  of  potentials  considered. 

By  assuming  N-1  orthogonal  functions  for  the  purpose  of  representing  qy  in  (1),  and  then  solving  for 
the  Vi’s  in  (2),  a  set  of  N-1  linear  equations  with  N-1  unknown  cci’s  can  be  defined  at  each  boundary  node. 
Solving  these  equations  yields  a  set  of  weighting  coefficients  (ai;  i=l,  N-1)  associated  with  each  boundary 
node.  Using  these  weighting  coefficients  in  place  of  the  finite  difference  schemes  weighting  coefficients, 
implements  the  MEI  method  and  enforces  the  radiation  condition.  Since  N  is  a  finite  number  (in  our 
case  equal  to  6),  the  new  finite-difference  global  matrix  is  still  sparse. 

IV  Numerical  Results 

Figure  1  shows  the  normalized  current  distribution  on  the  body  of  a  0.8 A  to  a  side  infinite  PEC  square 
cjffinder  illuminated  by  a  normally  incident  TE  plane  wave  as  predicted  by  the  FIT/MEI  and  MoM.  The 
real  and  imaginary  parts  of  the  current  are  plotted  from  the  middle  of  the  shadow  region  to  the  middle 
of  the  illuminated  region  of  the  cylinder.  The  FIT/MEI  problem  space  is  truncated  2  layers  away  from 
the  cylinder  and  has  a  total  of  336  edges,  where  40  are  fixed  (known)  and  296  are  the  unknowns.  The 
MoM  cylinder  is  subdivided  into  40  equal  segments.  As  can  be  seen  in  the  figure,  agreement  between 
the  FIT/MEI  and  MoM  predicted  currents  is  good  for  both  the  real  and  imaginary  parts. 

Figure  2  shows  the  normalized  current  distribution  on  the  body  of  an  infinite  circular  cylinder  {p  = 
0.15 A)  illuminated  by  a  TE  plane  wave  as  predicted  by  the  FIT/MEI  and  MoM.  The  current  is  plotted 
from  the  middle  of  the  shadow  region  to  the  middle  of  the  illuminated  region  of  the  cylinder.  The 
FIT/MEI  problem  space  is  truncated  2  layers  away  from  the  cylinder  and  has  a  total  of  112  edges,  where 
16  are  known  and  96  are  unknown.  The  MoM  cylinder  is  subdivided  into  16  equal  segments.  As  the  figure 
illustrates,  agreement  between  the  FIT/MEI  and  MoM  is  good  for  the  imaginary  part  of  the  current  and 
fair  for  the  real  part. 

Figures  3  and  4-5  show  the  predicted  normalized  current  distributions  (electric  and  magnetic,  respec¬ 
tively)  on  the  body  of  a  0.2A  to  a  side  infinite  square  dielectric  cylinder  illuminated  by  a  normally  incident 
TE  plane  w^ave  as  predicted  by  the  FIT/MEI  and  MoM.  For  the  constitutive  material  parameters  of  the 
cylinder,  the  permittivity  (cj.)  is  4.0  and  the  permeability  {pbr)  is  1,0.  The  currents  are  plotted  from  the 
middle  of  the  shadow  region  to  the  middle  of  illuminated  region  of  the  cylinder.  The  FIT/MEI  problem 
space  is  truncated  2  layers  away  from  the  dielectric  cylinder  and  has  a  616  unknowns.  The  MoM  cylinder 
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has  112  unknowns.  In  Figure  3,  the  agreement  between  the  FIT/MEI  and  MoM  predicted  electric  current 
distribution  is  good  for  both  the  real  and  imaginary  parts.  As  Figures  4-5  show,  the  magnetic  current 
distribution  also  shows  a  fair  agreement. 


V  Conclusions 

The  purpose  of  this  effort  was  to  validate  the  FIT/MEI  method  for  open  region  problems  illuminated 
by  transverse  electric  plane  waves,  and  no  effort  was  made  to  minimize  the  unknowns  for  either  the 
FIT/MEI  or  MOM  techniques,  nor  to  maximize  agreement  between  the  two  methods  by  increasing  the 
FIT/MEI  or  MOM  unknowns.  However,  since  the  FIT/MEI  technique  generates  a  sparse  matrix,  and 
easily  handles  inhomogeneous  or  anisotropic  materials  and  complex  geometries,  it  is  possible  that  the 
technique  could  deal  with  more  complex  and  electrically  larger  problems  that  the  MOM.  Presently,  work 
is  in  progress  to  test  this  possibility. 
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Figure  1:  The  normalized  electric  current  distribution  on  the  body  of  a  0.8 A  to  a  side  infinite  square  PEC 
cylinder  illuminated  by  a  normally  incident  TE  plane  wave  as  predicted  by  the  FIT/MEI  and  MoM.  The 
real  and  imaginary  parts  of  the  current  distribution  are  plotted  from  the  middle  of  the  shadow  region  to 
the  middle  of  the  illuminated  region. 


Position  (lambda) 

Figure  2:  The  normalized  electric  current  distribution  on  the  body  of  a  p—  0-15A  infinite  circular  PEC 
cylinder  illuminated  by  an  incident  TE  plane  wave  as  predicted  by  the  FIT/MEI  and  MoM.  The  real 
and  imaginary  parts  of  the  current  distribution  are  plotted  from  the  middle  of  the  shadow  region  to  the 
middle  of  the  illuminated  region. 
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Figure  3:  The  normalized  electric  current  distribution  on  the  body  of  a  0.2A  to  a  side  infinite  dielectric 
square  cylinder  (/ir=l-0  ,  tr=4.0)  illuminated  by  a  normally  incident  TE  plane  wave  as  predicted  by  the 
FIT/MEI  and  MoM.  The  real  and  imaginary  parts  of  the  current  are  plotted  from  the  middle  of  the 
shadow  region  to  the  middle  of  the  illuminated  region. 


Figure  4:  The  real  part  of  the  normalized  magnetic  current  distribution  on  the  body  of  a  0.2A  to  a  side 
infinite  dielectric  square  cylinder  (constitutive  parameters  ,  €r=4.0)  illuminated  by  a  normally 

incident  TE  plane  wave  as  predicted  by  the  FIT/MEI  and  MoM.  The  current  is  plotted  from  the  middle 
of  the  shadow  region  to  the  middle  of  the  illuminated  region. 
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Figure  5:  The  imaginary  part  of  the  normalized  magnetic  current  distribution  on  the  body  of  a  0.2A 
to  a  side  infinite  dielectric  square  cylinder  (constitutive  parameters  ,  er=4.0)  illuminated  by  a 

normally  incident  TE  plane  wave  as  predicted  by  the  FIT/MEI  and  MoM.  The  current  is  plotted  from 
the  middle  of  the  shadow  region  to  the  middle  of  the  illuminated  region. 
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IN  TRO  D  U  CTIO  N 

The  parabolic  wave  equation  proposed  by  Leontovich  and  Fock  in  the  middle  forties  has  many 
applications  to  diffraction  and  wave  propagation  [1].  It  can  serve  as  a  powerful  computational 
tool  for  complicated  environments  with  non-separable  variables  (  radio  propagation  over  a  curved 
terrain,  in  a  nonuniform  tropospheric  duct  etc.).  A  specific  difficulty  arises  when  the  wave  field  is 
to  be  found  in  an  infinite  domain.  In  order  to  handle  the  problem  numerically  one  has  to  reduce 
the  region  to  a  finite  strip  (in  two  dimensions)  or  to  a  cylinder  (in  ZD  case)  and  to  transfer  the 
radiation  condition  posed  at  infinity  onto  its  border  (surface).  It  is  desirable  to  do  that  exactly  m 
order  to  avoid  spurious  reflections  from  the  artificially  introduced  boundaries.  An  exact  form  of 
the  boundary  condition  providing  full  transparency  for  an  arbitrary  solution  of  the  parabolic  wave 
equation  in  two  dimensions  has  been  found  and  tested  by  Baskakov  and  Popov  [2].  An  equivalent 
formula  has  been  published  independently  by  Marcus  [3].  Here,  we  discuss  this  result  and  make 
necessary  generalizations  for  three-dimensional  problems  and  quasi-stratified  environments. 

DERIVATION  OF  TRANSPARENT  BOUNDARY  CONDITION  IN  2D 

The  simplest  case  we  consider  here  is  two-dimensional  diffraction  by  a  finite  dielectric  obstacle  m 
the  half-space  a:  >  0.  If  the  dielectric  permittivity  e(x,  z)  differs  not  much  from  unity:  |  e  -  1 1  «  1, 
a  paraxial  wave  beam  E  —  u{x^z)e*^^  can  be  described  by  the  parabolic  equation 

+  +  =  0  (1) 

OX  oz^ 

Assuming  that  variation  e-  1  vanishes  above  the  level  2  =  a  we  can  solve  Eq.{l)  in  the  quadrant 
I  >  0,  2  >  a  analytically.  So,  for  a  given  initial  value  u(0, 2)  =  Uo{z)  and  for  an  arbitrary  boundary 
value  u{x,a),  the  exact  solution  of  Eq.(l)  that  satisfies  the  radiation  condition  can  be  written  m 
the  following  form:  u  —  v  w  where 

v{x,  2)  =  j Ua{z  +  t)  exp  (^)  *  -  J“o{2a  -  z-\-t)  exp  j  dt  (2) 

M  —  Z  '  ' 


and 


w{x,  z)  =  - 


d 

y/ 2’Kk  dz 


dC 

Vx  -  i 


(3) 


Now  we  can  calculate  the  normal  derivative  du/dz(x,a).  Differentiating  Eqs.(2)-(3)  and  making 
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use  of  Eq.(l)  yield  the  following  relation 


0 

It  can  be  considered  as  a  nonlocal  generalization  of  the  third-kind  boundary  condition  which  enables 
one  to  integrate  Eq.(l)  numerically  within  the  strip  -a  <  z  <  a.  A  finite- difference  approximation 
of  Eq.(4)  has  been  examined  by  Baskakov  and  Popov  [2].  Numerical  tests  showed  full  absorption 
of  arbitrary  wave  packets  encountering  the  interface  z  =  a  from  inside  of  the  strip.  If  the  initial 
function  uq[z)  vanishes  for  z  '>  a,  the  boundary  condition  (4)  takes  a  homogeneous  form 

=  (5) 

that  ensues  also  from  factorization  of  the  differential  operator  (1); 


<9  \ 


THREE-DIMENSIONAL  CASE 

Consider  a  ZD  generalization  of  the  parabolic  wave  equation 

2ik^  4-  A±u  -f-  k^{e  -  l)u  —  0  (7) 

ox 

assuming  that  variations  e  -  1  and  the  initial  value  u(0,  r,  (^)  =  uo{r,ip)  are  localized  in  a  vicinity 
r  <  a  of  the  longitudinal  axis  (a  good  example  is  a  paraxial  wave  beam  truncated  by  a  finite 
aperture  and  propagating  through  a  dielectric  quasioptic  system).  As  the  parabolic  equation  (7) 
can  be  solved  analytically  for  r  >  a,  we  are  able  to  transfer  the  radiation  condition  from  infinity  to 
the  cylindrical  boundary  r  ~  a. 

Using  the  Fourier  expansion 

oo 

u{x,r^ip)=  (8) 

m=— oo 

and  Duhamel  principle,  one  gets  an  exact  expression 


uix,r,^)  =  ^£j  Ju{i,a,i>)did^  Y:  (9) 

0  0  m— -oo 

of  the  solution  u(ar,  r^(p)  outside  the  cylinder  r  =  a  in  terms  of  its  boundary  values  u(a:,a,^).  Here, 


1  2ikp )  ^ 

J  P 


are  partial  step  functions  satisfying  the  following  boundary  condition 


Um{x,a)  = 


1  for  X  >  0 

0  for  X  <  0 
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and  the  radiation  condition  for  r  -►  oo .  Simple  calculation  yields  a  perfectly  absorbing  boundary 

condition:  . 

"1  ^  ^  ^ 

^{x,a,<p)  =  4>)did->l>  (12) 

0  0 

with  ,  .  _ 

fl  V  f  e-  ^  (13) 

that  enables  one  to  reduce  the  computation  region  to  the  semicylinder  i  >  0,  r  <  a. 

SHORT-WAVE  ASYMPTOTIC 

The  kernel  of  the  surface  integral  operator  (12)  can  be  simplified  in  the  short-wave  limit  when 
the  radius  of  the  artificial  cylindrical  boundary  a  is  large  compared  with  the  wavelength  A  =  25r/fc. 
Assuming  that  i!:a  »  1,  replace  the  Hankel  functions  in  (13)  with  their  Debye  asymptotics 


I  'KTntanp 


i  [m  (t  o  n/? — /3 — 7r/4 )] 


/fW'(0~  isin/3/fW(f) 

where  cos  j3  =  mlt.  Then  the  integral  can  be  evaluated  analytically: 


'p  J 


2ikpa^  y/ p 


oo  y 

=  y/  TT  J  exp  I 


.  \  dt  m  27r 

^2ka^  /  a  V  ^ 


2i  /  da 


the  kernel  (13)  can  be  expressed  in  terms  of  Fresnel  integrals: 


K{x,ip)  ; 


where  ~  This  approximation  makes  the  boundary  condition  (12)  ready  for  an  immediate 

numerical  implementation. 

TRANSPARENT  BOUNDARIES  FOR  QUASI-STRATIFIED  ENVIRONMENTS 

A  parabolic  wave  equation  for  UHF  propagation  in  a  tropospheric  duct  has  been  introduced  by 
Fock  [1] 

2ik^  +  ^  +  2k^-^[l+g{x,z)]u  =  0  (18) 

Here,  the  height- dependent  term  zjR*  describes  the  joint  influence  of  the  earth  sphericity  and 
barometric  refraction  (/?*  is  the  equivalent  earth’s  radius),  and  the  correction  g{Xy  z)  corresponds  to 
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nonuniform  variations  of  the  refraction  index  localized  below  some  reference  height  z  =  h  (similar 
models  comprising  a  horizontally  stratified  background  and  localized  perturbations  are  used  in 
ocean  acoustics,  Tappert  [4]).  Having  in  mind  numerical  integration  of  Eq.(18),  we  have  derived 
a  boundary  condition  reducing  the  problem  posed  in  the  half-space  z  >  0  to  a  strip  0  <  z  <  b. 
Omitting  details,  we  just  write  down  the  result  in  dimensionless  variables  ^  =  {kl2R*^y^^x,  (  = 
13  =  {2P/R*f\ 


twi{t  -  /3) 


and  u;i(^)  is  first-kind  Airy-Fock  function.  It  is  evident  that  similar  transparent  boundary  condi¬ 
tions  can  be  obtained  for  other  height-dependent  backgrounds. 

ASYMPTOTIC  BEHAVIOR  OF  BOUNDARY  OPERATORS 

Our  approach  to  truncation  of  the  computational  domain  yields  a  family  of  special  functions 
being  the  kernels  of  integral  boundary  operators.  The  simplest  2D  example  (4)  shows  that  the 
kernel  has  a  weak  singularity  at  the  end  of  the  integration  path  and  some  non-local  "tail”.  It  is 
interesting  to  examine  the  less  trivial  examples  given  above  from  this  point  of  view. 

Consider  the  3D  parabolic  equation  (7).  In  the  most  interesting  axial  symmetric  case,  the 
boundary  condition  (12)  takes  the  form 

= J  y'{^,a)Koix- (21) 

0 

with 

-  ^  VP 

In  can  be  seen  that  its  asymptotic  behavior  depends  on  the  magnitude  of  the  Fresnel  parameter 
q  —  xl2kP.  For  ^  <  1,  substituting  the  asymptotics  Hy\z)  ^  ^2/7rz  for  the  Hankel 

function,  one  gets 

Koix)  ~  -e-Vy2A:/7rz  (23) 

We  see  that  the  boundary  operator  (17)  has  the  same  singularity  as  its  2D  prototype  (4)  and 
reduces  to  the  latter  for  large  radii  a.  The  opposite  limiting  case  x  — >  oo  can  be  examined  by 
replacing  Hq‘\z)  with  its  approximation  2/7rlogz  for  small  2:.  It  makes  possible  to  evaluate  the 
integral  (18)  asymptotically: 

Ko(x)  ~  + 

X  00  J  log  (aV 2kt )  -  i7r/4 


G  V  log(uV' 2kt )  9-»-oo  a  log 

This  formula  shows  that  in  three  dimensions  the  kernel  tail  descends  more  slowly  than  l/Vx, 
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A  similar  analysis  for  the  PWE  (18)  in  a  quasi-stratified  medium  reveals  the  same  type  of 
singularity: 

K+(f\-  ^  (25) 

J  "  tw,{t  ~  ^) 

—oo~ic 

and  a  ’’tail”,  for  large  C, 


K+  _  e’^«  y  gxp(-*4e  ‘^)  ^  4,  0(e-^« )  (26) 

tending  exponentially  to  a  constant  limit. 

This  constant  does  not  give  nonlocal  contribution  to  the  integral  boundary  operator,  therefore, 
in  this  case  the  transparent  boundary  has  extremely  short  ’’memory”.  It  is  natural  because  the  rays 
corresponding  to  the  PWE  (18)  bend  upwards  and  leave  the  upper  boundary  z  =  b  rapidly. 

On  the  contrary,  if  we  write  down  an  analogous  boundary  condition  providing  perfect  trans¬ 
parency  of  an  arbitrary  chosen  ’’bottom”  of  the  nonuniform  layer  a  <  z  <  b ,  we  will  see  that  the 
boundary  operator  has  very  long  ’’memory”.  The  corresponding  kernel 


oo-:c  ..  ^ 

1  f  :.c  V  it  -  a) 
®  tv{t  -  a) 


tv{i  -  a)  4-^0  y TT^ 


by  substitution 


V  W2  VW2 


v{t)  and  W2it)  =  UJi(0  -  2iv{t)  being  Airy-Fock  functions,  can  be  reduced  to  the  following  form 


/  /  \  1  ooe--  ' 

M>2(-Q^)  I  J_  f 

W2{-Ct)  StTZ  j 


tv{t  -  a)w2{t  -  a) 


with  a  =  This  integral  is  closely  related  to  the  special  function  investigated  by 

Buldyrev  and  Lanin  [5]  in  the  problem  of  the  whispering  gallery  wave  propagation. 

WIDE-ANGLE  PARABOLIC  EQUATIONS 

Like  any  asymptotic  approximation,  the  parabolic  wave  equation  has  its  applicability  limits. 
First,  it  does  not  hold  for  the  wave  packets  with  wide  angular  spectrum.  Secondly,  at  great  dis¬ 
tances,  accumulation  of  phase  approximation  errors  takes  place  which  distorts  the  calculated  field 
distribution.  In  order  to  overcome  these  limitations,  a  number  of  modified  equations  to  replace  the 
original  PWE  has  been  put  forward.  Here,  the  wide-angle  Claerbout  approximation  [6], 


2ik[\^ 


I  d^\du 


+  —  0 


4/^2  dz'^  j  dx  dz^ 


the  fourth-order  long-range  PWE  [7] 


2^fc^+  l- 


I  9^  \  d^u 


dz^  /  dz^ 


and  the  exact  correction  formula  [8] 


X  y  ,k  ,  \.,P  dt 

0 
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(32) 


relating  the  PWE  solutions  u(x^z)  with  the  rigorous  wave  field  E{x,z)  satisfying  the  one-way 
Helmholtz  equation 

=  +  ^  „  (33) 

can  be  mentioned. 

All  these  equations  require  some  boundary  conditions  eliminating  spurious  reflections  when 
being  solved  numerically  in  a  truncated  spatial  domain.  Using  the  technique  describe  above,  they 
can  be  derived  in  a  similar  way.  As  an  example,  consider  the  wide-angle  PWE  (30)  in  the  quadrant 
X  >  0,  z  >  a.  Its  general  solution  satisfying  the  initial  condition  i£(0,  z)  —  0  and  radiation  condition 
by  z  ^  oo  can  be  expressed  in  terms  of  its  boundary  values  u(a;,a): 


=  -^  J  U{^,a)U{x  -  ^,z)d^ 


^  c+ioc 

U{x,z)  =  /  exp  px  -f-  (a 

ZTTZ  J 

c— too  L 


2kp  dp 


is  the  step  function  -  cf.  (10)  . 

Evaluating  the  normal  derivative  yields 


Kix)  =  ^  7“  e-  ^  ^  =  2^k,^^Mkx)  (37) 

Here,  Jo{t)  is  Bessel  function.  Note  that  the  nonlocal  boundary  conditions  of  transparency  are 
constructed  in  a  standard  way.  What  depends  on  the  specific  problem  is  the  possibility  of  reducing 
the  kernel’s  integral  representation  to  a  simple  closed-form  expression, 

APPLICATIONS  AND  NUMERICAL  EXAMPLES 

The  boundary  conditions  of  transparency  described  above  admit  effective  numerical  implemen¬ 
tation  in  the  framework  of  the  finite  difference  method.  They  have  been  used  in  a  number  of 
diffraction  and  wave  propagation  problems.  Among  them  are:  radio  propagation  over  an  irregular 
terrain  or  in  a  tropospheric  wave  duct,  microwave  and  JA-ray  quasioptical  systems.  We  demonstrate 
some  examples  of  field  calculation  using  transparent  boundaries  to  eliminate  spurious  reflections, 

CONCLUSION 


The  objective  of  this  paper  was  to  find  an  exact  form  of  boundary  conditions  providing  full 
absorption  of  arbitrary  paraxial  wave  packets  described  by  the  parabolic  equation.  It  would  enable 
one  to  reduce  rigorously  an  infinite  domain  to  a  strip  or  cylinder  for  numerical  calculations.  We 
proposed  a  general  approach  to  this  problem  and  demonstrated  some  practical  examples. 
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1  Statistical  Electromagnetics  (STEM)  Overview 

Interest  in  the  statistical  properties  and  treatment  of  electromagnetic  (EM)  fields  and 
their  interactions  has  begun  to  increase  in  recent  years  [1] .  One  cause  for  this  increase 
is  the  myriad  failures  of  traditional  methods  to  interpret  and  make  useful  predictions 
from  measurement  data  which,  in  turn,  has  created  an  awareness  that  new  approaches 
to  RF/microwave  experiment  design  are  now  both  appropriate  and  necessary.  The 
need  to  better  define  data  requirements  and  to  simultaneously  reduce  the  number  of 
samples/measurements  in  future  experimental  programs,  strongly  driven  by  economic 
constraints,  gives  additional  incentive  to  pursue  physics-based  statistical  modeling 
techniques  for  high-precision  experimental  characterizations. 

Another  cause  for  greater  interest  in  STEM  is  the  extremely  limited  ability,  in 
actual  practice,  to  directly  solve  Maxwell’s  equations,  subject  to  given  boundary  con¬ 
ditions,  for  problems  of  practical  interest  (even  of  only  modest  complexity).  Recent 
work  has  demonstrated  the  complicated,  chaotic  nature  of  the  EM  response  of  simple 
shapes  where  the  boundary  conditions  render  Maxwell’s  equations  non- integr able  [2] 
-[9], 

Furthermore,  even  in  a  great  many  cases  where  deterministic  solutions  are  straight¬ 
forward,  the  required  computer  CPU  time  to  achieve  a  solution  precludes  such  an 
approach.  For  example,  consider  upset  analysis  of  a  circuit  comprising  250  gates,  so 
the  number  of  system  states  is  2^^°  =  1.81  x  lO’^®.  Simply  taking  each  state  as  an 
unknowm  for  numerical  solution,  at  one  operation  per  unknown  and  with  a  teraflop 
computer,  the  time  requirement  for  one  full  numerical  characterization  is  5.7  x  10®^ 
years.  Practical  overmoded  mode-stirred  chamber  (MSC)  analysis  and  a  multitude 
of  other  problems  dramatically  illustrate  this  same  point. 

Yet  another  contributor  to  interest  in  STEM  is  an  unsubstantiated,  but  rather 
widespread  opinion  that  the  fine-grid  point  calculation  of  EM  fields  in  space  and  time 
may  be  unnecessary  to  reach  certain  final  conclusions  within  some  macroscopic  ‘per¬ 
formance  envelope’  sense.  While  such  possibilities  are  now  merely  conjecture,  they 
do  stimulate  speculation  that  future  STEM  research  might  address  such  questions. 
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One  more  impetus  for  STEM  is  that  we  must,  of  necessity,  muddle  through  life 
making  choices  and  decisions  about  EM  interactions  based  on  incomplete  and  flawed 
information.  While  most  of  us  learn  to  live  comfortably  with  some  level  of  uncertainty, 
there  is  always  a  common  desire  to  quantify  the  uncertainty,  and  to  make  it  precise. 
It  follows  from  the  ability  to  quantify  uncertainty  that  applied  EM  practitioners 
can  start  to  make  categorical  statements  with  complete  assurance  about  their  level  of 
uncertainty.  Thus  the  preeminent  objective  in  the  development  of  STEM  techniques  is 
to  be  able  to  extract  useful,  quantitative  conclusions  for  decision  makers  and  designers 
from  incomplete  and  imperfect  information,  either  in  data  or  in  the  specification  of  a 
problem  environment;  this  significantly  reduces  their  programmatic  risk  and  enhances 
confidence  in  their  decisions. 

Statistical  electromagnetics  can  be  used  to  resolve  problems  in  many  diverse  ap¬ 
plication  areas.  However,  more  basic  research  must  be  concluded  before  practical 
applications  will  yield  to  these  new  techniques.  It  is  anticipated  that  useful  methods 
for  high  frequencies  can  be  developed  in  the  reasonably  near  future,  while  extensions 
to  lower  frequency  ranges  will  require  additional  time. 

So  far,  most  endeavors  to  describe  EM  coupling  and  interactions  statistically  have 
relied  on  collections  of  measured  data,  have  employed  empirical  (best-fit)  statistics, 
and/or  have  relied  on  expert  engineering  judgment.  The  empirical,  or  curve  fitting, 
approach  generally  allows  significant  variation  between  predictions  because  one  is 
working  with  limited,  imperfect  data  sets.  Without  a  priori  knowledge  about  the 
appropriate  distribution  function(s),  the  success  of  empirical  data-fitting  has  been 
likewise  variable.  Even  the  best  among  these  studies  are  subject  to  persistent  and  le¬ 
gitimate  questions  regarding  range  of  validity.  Limited  data  tends  to  be  concentrated 
in  the  vicinity  of  the  mean,  so  that  a  good  fit  may  be  suggested  for  several  distribution 
functions.  The  range  of  validity  can  only  be  extended  by  taking  more  measurements, 
in  order  to  acquire  sufficient  data  distant  from  the  mean.  However,  the  number  of 
measurements  required  to  confidently  establish  the  distribution  is  usually  precluded 
by  time  or  budget  limitations. 

Another  related  and  simultaneous  objective  for  STEM  is  to  develop  techniques 
for  the  meaningful  analysis  of  systems,  or  classes  of  problems,  which  undergo  small 
changes  or  variations.  These  problems  are  distinguished  from  those  of  ‘fixed’  sys¬ 
tems,  where  the  principal  limitations  involve  non-integrability  coupled  with  incom¬ 
plete  and  inaccurate  environment  specification,  computer  speeds  and  storage  capa¬ 
bilities,  and  measurement  noise/errors.  In  many  important  practical  problems,  small 
system  changes  or  perturbations  are  observed  to  produce  large  variations  in  observed 
EM  response(s) .  While  these  types  of  problems  are  highly  resistant  to  treatment  by 
deterministic  methods,  it  is  believed  that  they  are  amenable  to  the  application  of 
physics-based  statistical  methods. 

Because  STEM  models  are  physics-based  and  begin  with  a  deterministic  model, 
the  predictions  must  not  violate  known  laws  of  physics  and  should  be  verifiable  by 
experiment.  Physics-based  statistical  models  can  be  progressively  improved  by  both 
improving  the  underlying  physics  and  substantiating  the  truth  of  assumptions;  theory, 
simulation  by  means  of  computational  electromagnetics  (OEM),  and  experiment  are 
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symbiotic  partners  in  the  refinement  process. 


2  Motivation  for  STEM  Development 

Understanding  the  electromagnetic  (EM)  response  of  complex  systems  is  a  key  risk 
minimization  activity  for  many  programs.  Commercial  ventures,  such  as  aircraft 
and  automotive  manufacturing,  face  these  difficulties  as  well  as  DoD,  NASA,  and 
other  government  agencies.  Confidence  in  the  ability  to  explain,  and  predict,  how 
complicated  systems  respond  to  EM  radiation  facilitates  sound  program  management. 
Useful  conclusions  for  decision  makers,  system  planners,  and  design  engineers  can  be 
extracted  by  the  application  of  statistical  techniques  to  this  important  class  of  applied 
EM  problems. 

Statistical  methods  are  justified  by  two  major  difficulties.  First,  the  non-integrable 
nature  of  many  EM  response  problems  renders  analytic  solutions  intractable;  the 
chaotic  nature  of  the  resulting  eigenfunctions,  eigenfrequencies,  and  EM  energy  flow 
(voltages  and  currents)  is  best  described  statistically.  Second,  statistical  techniques 
are  also  effectively  applicable  to  system  susceptibility  analysis  when  examining  the 
effect  of  perturbations  (topology,  measurement  location,  source  parameters,  etc.)  and 
incomplete  target /source  configuration  information  (which  may  cause  large  EM  re¬ 
sponse  changes) . 

The  acronym  STEM  has  been  coined  from  STatistical  ElectroMagnetics;  STEM 
embraces  the  study  of  non-integrability  and  physics-based  statistics  as  applied  to 
EM  response  problems.  STEM  applications  include  problems  associated  with  large 
complex  system  susceptibility,  mode-stirred  chamber  validity  and  interpretation,  de¬ 
velopment  of  calibrated  measurement  techniques,  pre-test  prediction,  far-field  EM 
scattering,  target  recognition,  and  problems  in  other  areas  of  science  and  engineering 
such  as  space  science  and  geophysics. 

Some  aspects  of  a  research  initiation  program  for  the  orderly  discovery,  devel¬ 
opment,  and  exploitation  of  STEM  are  reported  here.  Problems  areas  of  immediate 
interest  include  EM  measurement  and  testing,  fundamental  EM  interactions  and  cou¬ 
pling,  and  programmatic  risk  reduction. 

3  STEM  Preliminaries 

A  comprehensive  application  of  statistics  to  a  given  complex  EM  problem  ultimately 
must  include  a  system  model,  a  statistical  model,  and  the  application  of  probability 
theory  to  predict  system  responses.  In  the  near  future,  it  is  expected  that  exploration 
and  development  efforts  will  emphasize  physics- based  statistical  models,  namely,  the 
joint  density  functions  /(JAi,  X2, . . . ,  Xn)  associated  with  random  variables  X^.  The 
probabilistic  modeling  process  presumes  that  we  can  identify  distinct  variables  to 
be  treated  statistically,  that  a  valid  physics-based  model  can  be  constructed,  that 
modeling  should  be  in  terms  of  measurable  parameters  and,  finally,  that  it  should  be 
possible  to  demonstrate  by  experiment  the  conditions  for,  and  ranges  of  validity  for 
all  assumptions. 
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To  the  extent  possible,  a  program  which  develops  methodology  and  results  in 
STEM  corresponding  to  those  in  statistical  mechanics  (both  classical  and  quantum) 
is  appealing.  While  statistical  electromagnetics  has  some  obvious  differences  which 
must  be  accommodated  (such  as  spatial  correlations,  boundary  conditions,  and  ‘ap¬ 
propriate’  functions),  there  are  also  attractive  similarities.  In  principle,  it  is  possible 
to  deterministically  arrive  at  all  the  details  of  an  electromagnetics  problem  from  solv¬ 
ing  Maxwell’s  equations  subject  to  the  relevant  boundary  conditions,  clearly  analo¬ 
gous  to  the  detailed  dynamics  encountered  in  integrable  and  non-integrable  classical 
and  quantum  mechanics.  Furthermore,  profound  successes  with  statistical  mechanics 
inspire  the  hope  that  corresponding  powerful  principles  also  are  valid,  in  some  dual 
form,  for  electromagnetics.  For  example,  applied  EM  practitioners  view  with  great 
interest  (and  envy)  the  statistical  mechanics  result  that  the  velocity  distribution  of 
molecules  in  a  container  is  Maxwellian,  regardless  of  the  container’s  shape.  Of  course, 
we  realize  this  is  not  true  except  in  the  case  of  blackbody  radiation.  Most,  if  not 
all,  practical  EM  problems  involve  specular  reflections  or  refractions  at  boundaries, 
hence  the  failure  of  thermal  equilibrium  thermo d5Tiamics  for  the  useful  EM  case. 

In  the  case  of  applied  EM,  most  problems  of  practical  interest  have  complex 
geometrical  and  material  environments,  with  highly  intricate  boundaries.  This  non- 
integrability  creates  a  fundamental  inability  to  completely  and  deterministically  spec¬ 
ify  eigenfunctions,  eigenfrequencies,  and  the  associated  EM  energy  flow.  Because  sta¬ 
tistical  modeling  has  been  so  effective  with  related  conceptual  problems  in  statistical 
mechanics,  there  is  reasonable  cause  for  optimism  that  it  can  be  a  valuable  tool  in 
applied  EM  as  well. 

The  developers  of  statistical  mechanics  accomplished  their  feats  of  mathematical 
legerdemain  principally  by  exploring  the  relationships  between  dynamical  invariants 
of  appropriate  functions.  That  is,  the  final  statistical  theory  was  achieved  after  exten¬ 
sive  theoretical  work  on  determination  of  the  properties  of  distribution  functions.  A 
developmental  approach  for  applied  EM  comprising  parallel  theoretical,  experimental, 
and  numerical/simulation  efforts  will  be  necessary  to  characterize  the  relationships 
and  behavior  of  the  important  corresponding  functions  of  electromagnetic  variables. 

Different  ‘appropriate’  functions  apply  to  STEM;  these  functions  should  repre¬ 
sent  measurable  quantities  such  as  the  actions  associated  with  the  E  and  H  field 
components,  as  well  as  the  energy  flow  (Poynting  vector).  Also,  because  the  fields  at 
different  locations  and/or  different  times  are  correlated,  spatial  and  temporal  correla¬ 
tion  functions  are  of  special  interest  in  the  EM  case.  It  is  appropriate  that  early  work 
in  the  discovery  phase  of  STEM  should  emphasize  the  development  of  relationships 
between  these  quantities.  The  motivation  for  this  is  the  fact  that  such  relationships 
exist  and  proved  vital  for  the  codification  of  statistical  mechanics  as  we  know  it  today. 

Because  eventual  statistical  model  predictions  are  rigorously  required  to  obey 
physical  laws,  any  a  priori  distribution  functions  and  their  conditions/ranges  for 
validity  should  be  developed  from  first  principles  of  physics,  and  based  solidly  on 
Maxwell’s  equations.  While  there  is  widespread  belief  at  this  time  that  there  are 
classes  of  EM  problems  which  share  common  distributions  obtainable  from  physics- 
based  statistics,  that  view  has  not  been  adequately  substantiated  by  theory,  even 
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though  experiments  and  computer  simulations  suggest  these  relationships  exist. 

4  Frequency  Regimes 

STEM  concepts  can  be  most  directly  and  quickly  applied  at  high  frequencies,  where 
wavelengths  become  small  in  comparison  to  component  and/or  system  dimensions.  At 
such  relatively  high  frequencies,  the  number  of  modes  is  large  and  problems  involving 
cable  bundles,  multiple  EM  entry  ports,  etc.,  clearly  become  stochastic  in  nature  as 
the  problem  becomes  non-integrable. 

In  contrast,  deterministic  methods  become  more  powerful  at  lower  frequencies, 
where  few  modes  exist.  Deterministic  solutions  are  directly  attainable  for  integrable 
shapes  at  low  frequencies,  and  that  this  regime  generally  is  viewed  as  rather  simple. 
Nonintegrable  shapes  at  low  frequencies  are  not  as  well  determined  analytically,  but 
numerical  methods  and/or  experimental  characterizations  are  straightforward. 

Intermediate  frequencies,  associated  with  ‘several’  modes,  are  generally  intractable. 
It  appears  most  likely  that  the  successful  methods  for  analysis  and  prediction  in  this 
frequency  range  will  be  a  hybrid  of  theory,  simulation,  and  experiments,  but  it  is 
difficult  to  foresee  the  ultimate  details  of  such  methods. 

As  a  result,  it  is  now  anticipated  that  STEM  developmental  projects  will  first 
concentrate  on  the  high-frequency  regime.  Extension  of  results  to  progressively  lower 
frequencies,  incorporating  appropriate  modifications  to  the  theory  and  results,  will 
follow  only  after  the  high-frequency  results  are  promulgated  and  achieve  reasonable 
acceptance  among  researchers  active  in  the  field. 

5  First  Initiative 

The  basic  thrusts  of  a  current  STEM  development  initiation  effort  is  depicted  in 
Figure  1.  The  two  major  short-term  goals  are  (1)  to  improve  the  definitions  and 
understanding  of  basic  phenomenology  associated  with  STEM,  and  (2)  to  produce  a 
STEM  monograph  incorporating  the  principles  necessary  to  tackle  advanced  appli¬ 
cations.  The  point  of  view  emphasized  here  is  principally  that  of  the  EM  engineer, 
tasked  with  discerning  the  best  applications  of  STEM  to  practical  circuits  and  sys¬ 
tems. 

While  a  detailed  technical  review  of  previous  work  is  beyond  the  scope  of  this  pa¬ 
per,  a  few  selected  references  are  briefly  cited  here  so  that  the  interested  reader  can 
gain  a  better  appreciation  for  the  entries  of  Figs.  1  and  2,  Results  from  studies  and 
measurement  programs  at  NIST  [10]  -  [13]  imply  that  a  wide  variety  of  reverberation 
chambers,  more  often  called  mode-stirred  chambers  (MSCs),  share  common  EM  char¬ 
acteristics  to  the  extent  that  it  is  realistic  to  believe  a  comprehensive,  valid  STEM 
treatment  of  MSCs  can  be  developed.  An  early  report  on  statistical  distribution  of 
EM  field  amplitudes  in  complex  cavities  was  published  by  JAYCOR  [14]  in  1988, 
and  a  paper  based  on  this  report  just  recently  appeared  [15].  The  JAYCOR  report 
claims,  from  both  best-fit  empirical  case  studies  and  a  theoretical  derivation,  that  the 
power  measured  by  a  unidirectional  probe  in  a  ‘complex’  cavity  has  a  ^  distribution 
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with  two  degrees  of  freedom.  An  overmoded  MSC  statistical  model  was  reported 
in  1991  [16]  which  is  useful  for  some  cases.  The  most  ambitious  work  on  statistical 
behavior  of  fields  in  complex  cavities  published  within  the  applied  EM  engineering 
community  was  just  recently  completed  [17].  A  question  of  the  completeness  of  [17] 
for  cavities  with  specular  walls  arises  from  classical  sources  such  as  [18]  as  well  as 
more  modern  physics  literature  which  carefully  considers  the  impact  of  integrability 
along  with  boundary  conditions  (including  but  not  limited  to  [2]  -  [9]);  this  question 
is  now  under  study. 

The  “monograph  compilation  of  results”  indicated  in  Figure  1  is  expected  to 
encompass  and  report  on  theoretical,  experimental,  and  simulation  studies.  The  ob¬ 
jective  is  to  achieve  cloture  on  the  previous  work  reported  in  both  the  engineering  and 
physics  literature,  including  extensions  which  clearly  become  necessary  by  discovery 
in  the  review  process.  It  is  now  intended  that  the  scope  of  the  first  monograph  will  be 
restricted  to  the  high-frequency  regime;  modifications  of  the  theory  and  techniques 
to  apply  at  progressively  lower  frequencies  will  follow. 


Figure  1.  STEM  research  initiation  program  plan. 
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Figure  2.  Second-phase  subtasks  proceeding  from  STEM  monograph. 


6  Advanced  Applications  Research  Projects 

Various  projects  to  broaden  the  range  of  meaningful  application  of  STEM  concepts 
will  become  possible  once  a  monograph  of  first  principles  is  available.  Figure  2  indi¬ 
cates  the  three  major  subdivisions  for  second-phase  applications  research  as  presently 
envisioned:  (1)  mode-stirred  chamber  (MSC)  studies,  (2)  EM  coupling  problems,  and 
(3)  large  system  upset  analysis.  Detailed  discussion  of  these  areas  is  precluded  by 
available  space  here,  and  would  be  premature  in  view  of  the  discoveries  which  are  cer¬ 
tain  to  occur  as  development  of  the  monograph  proceeds.  Details  regarding  ad\'anced 
projects  will  be  made  available  as  the  topic  of  a  future  paper,  as  STEM  matures. 
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Abstract. 

This  paper  presents  backscattering  patterns  from  aperiodic  grids  optimized  with  genetic 
algorithms.  A  method  of  moments  algorithm  is  used  to  calculate  the  induced  surface  current  density 
on  the  strips.  The  optimization  procedure  is  very  slow,  so  several  acceleration  methods  were 
investigated. 

I.  Introduction. 

Grids  are  important  in  the  design  of  reflector  antennas  and  radomes.  When  uniform  grids  are 
very  large,  they  can  be  accurately  represented  by  infinite  periodic  grids.  Aperiodic  grids,  small  grids 
or  curved  grids  can’t  take  advantage  of  the  periodicity  of  the  grid  to  simplify  induced  current 
calculations.  An  aperiodic  grid  of  strips  consists  of  a  finite  number  of  infinitely  long  strips  lying  in 
a  plane  with  all  the  edges  parallel  to  each  other.  These  strips  are  extremely  thin  and  perfectly 
conducting,  and  may  have  different  widths  and  spacings.  By  properly  choosing  the  strip  widths  and 
spacings,  the  backscattering  patterns  from  the  grid  can  be  modified  to  achieve  a  certain  sidelobe  level 
or  a  null  in  a  given  direction. 

A  genetic  algorithm  is  used  to  optimize  the  widths  and  spacings  of  perfectly  conducting  strips 
to  obtain  the  lowest  possible  relative  sidelobe  level.  Genetic  algorithms  can  optimize  a  large  number 
of  discrete  parameters,  such  as  strips  in  a  grating.  Other  optimization  techniques  have  difficulty 
dealing  with  the  large  number  of  parameters.  Genetic  algorithms  are  ideally  suited  for  parallel 
computing  but  are  quite  slow  on  serial  machines.  This  paper  presents  some  results  of  optimizing 
aperiodic  perfectly  conducting  strips  using  MATLAB  on  a  Sun  Sparc  20  computer  and  some  attempts 
at  accelerating  the  very  slow  convergence.  Optimized  grids  have  significantly  lower  sidelobes  than 
uniform  grids. 

n.  Problem  Formulation. 

Figure  1  shows  a  model  of  an  aperiodic  grid  of  M  perfectly  conducting  strips  lying  along  the 
x-axis.  The  incident  plane  wave  has  the  electric  field  parallel  to  the  edges  of  the  strips,  and  the 
incident  magnetic  field  is  assumed  to  have  a  magnitude  of  one.  An  integral  formulation  to  find  the 
M  current  densities  is  given  by 
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^jkxcosu-^'Y^  j HQ^\k\x-x'\)dx^ 


where 

M  =  number  of  conducting  strips 
=  surface  current  density  on  strip  m 

=  zeroth  order  Hankel  function  of  the  second  kind 
k  =  2ti/X 

U  =  COS(j) 

x^'  and  =  beginning  and  ending  points  of  strip  m 


Expanding  the  current  on  strip  m  into  N  basis  functions  and  matching  the  boundary  conditions  at  p 
=  1,2,  P  =  M  X  N  points  transforms  the  integral  equation  into 


where  Ap*  and  Ap"^  are  beginning  and  ending  points  of  pulse  p.  Once  the  current  density  is  found,  the 
backscattering  is  calculated, 

A  genetic  algorithm  is  used  to  optimize  the  width  of  each  strip  in  the  grid  and/or  the  spacings 
between  strips  in  the  grid  in  order  to  produce  the  lowest  maximum  relative  sidelobe  level  in  the 
backscattering  pattern.  This  algorithm  uses  ideas  from  biology  to  determine  the  fittest  or  optimum 
grid  configuration.  Details  of  this  algorithm  can  be  found  in  [1]  and  [2]. 

in.  Results. 

The  goal  is  to  optimize  a  grid  of  40  perfectly  conducting  strips,  so  that  the  backscattering 
pattern  has  the  lowest  possible  maximum  relative  sidelobe  level.  A  genetic  algorithm  performs  the 
optimization  using  point  matching  with  five  collocation  points  per  strip.  Strip  widths  and  spacings 
are  limited  to  increments  of  0.03125^  between  0.03 125X  and  0.25X.  Five  collocation  points  per  strip 
are  used  to  characterize  the  current  density.  The  optimization  results  in  a  maximum  relative  sidelobe 
level  of  -20.08  dB.  Figure  2  shows  a  diagram  of  the  optimized  aperiodic  grid.  Note  that  both  the 
strip  widths  and  spacings  are  nonuniform.  Figure  3  shows  the  resulting  backscattering  pattern  from 
the  grid  in  Figure  2. 

One  problem  with  this  optimization  procedure  is  that  it  takes  a  considerable  amount  of  time. 
The  impedance  matrix  has  200x200  elements.  Even  though  genetic  algorithms  check  a  very  small 
portion  of  the  solution  space,  they  make  a  hundred  or  more  function  calls  every  iteration.  Every 
function  call  requires  the  decomposition  of  the  matrix  and  solving  for  the  surface  current  density  and 
the  backscatter  for  each  angle  in  the  far  field. 

flops„o„  =  Ch(M  •  N)^  +  C,„(M  •  N)^  +  NaC,3(M-N)^  +  C„(M  •  N)" 

=  (Ch  +  C,„(M  •  N)  +  +  Cff  )•  (M  •  N)^ 
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where  Q,  ,  Qj,  and  are  constants  representing 
Ch  =  matrix  fill-time 
C,u  =  lu  decomposition  of  matrix 
Cbs  =  backsubstitution 
Cff  =  far  field  calculation 

If  there  are  40  strips  and  5  collocation  points  per  strip  with  200  far  field  backscattering 
calculations 


Designing  any  large  grid  or  trying  to  design  two  dimensional  grids  becomes  unreasonable  with  such 
large  computation  times.  Consequently,  I  have  tried  three  other  approaches  to  accelerate  the  process. 
First,  I  tried  physical  optics,  then  making  the  impedance  matrix  sparse,  and  finally  using  fewer 
collocation  points. 

Physical  optics  works  well  for  modeling  resistive  strips,  but  doesn’t  model  grids  of  perfectly 
conducting  strips  well  [3].  The  same  optimization  was  performed  with  physical  optics  calculations, 
then  the  backscattering  pattern  was  calculated  using  the  MOM.  The  results  were  quite  poor. 
Interactions  between  the  strips  are  too  strong  to  ignore. 

Next,  I  tried  working  with  a  sparse  impedance  matrix.  Unfortunately,  the  coupling  between 
the  strips  is  so  strong,  that  the  matrix  is  very  full.  Eliminating  smaller  matrix  elements  makes  the 
calculations  very  inaccurate. 

Finally,  I  tried  using  only  one  collocation  point  per  strip  when  calculating  the  current  density. 
Since  the  matrix  solver  takes  on  the  order  of  (MN)^  operations,  the  matrix  solve  time  is  reduced  by 
403/2003  =  0.008.  This  alternative  works  quite  well.  Although  the  backscattering  calculations  are 
off  by  as  much  as  1  dB,  they  are  accurate  enough  to  give  excellent  results.  Better  results  could  be 
obtained  with  an  iteration  or  two  of  the  genetic  algorithm  calculating  the  current  densities  with  5 
pulses  per  strip  and  using  the  1  pulse  per  strip  result  as  a  seed.  This  acceleration  method  offers  the 
best  tradeoff  between  accuracy  and  speed. 

IV.  Conclusions. 

This  paper  showed  how  to  optimize  the  backscattering  pattern  of  a  grid  of  perfectly  conducting 
strips  by  adjusting  the  widths  and  spacings  of  the  strips.  A  40  element  grid  was  optimized  to  yield 
a  maximum  relative  sidelobe  level  of  -20  dB  in  the  backscattering  pattern.  The  genetic  algorithm 
optimization  is  quite  slow  on  serial  machines.  In  order  to  accelerate  the  convergence,  the  surface 
current  calculations  were  made  using  1  pulse  per  strip. 
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ABSTRACT 

Computational  accuracy  has  been  widely  recognized  as  a  critical  issue  in  applied  electromagnetics. 
At  the  same  time,  many  applications  involve  a  conflicting  design  goal  of  maximum  computational 
efficiency.  This  paper  summarizes  a  study  of  applications  of  massively  parallel  high  performance 
computing  to  analysis  of  electromagnetic  scattering  from  a  number  of  targets.  Specifically,  this  paper 
focuses  on  the  use  of  andvanced  numerical  techniques  and  massively  parallel  computing  in  implementation 
of  the  method  of  moments  (MOM)  to  numerical  solution  of  integral  equation  electromagnetic  scattering 
formulations.  The  goal  of  this  effort  is  to  increase  numerical  accuracy  without  sacrificing  computational 
efficiency. 

The  main  thrust  of  the  work  reported  here  is  to  explore  several  strategies  for  improving  the  accuracy 
of  method  of  moments  (MOM)  calculations  of  electromagnetic  scattering  from  metallic,  dielectric,  and 
resistive  bodies.  These  strategies,  involving  more  intensive  computation  than  less  accurate  conventional 
techniques,  are  ideally  suited  for  implementation  on  a  massively  parallel  high-performance  computer.  The 
greatly  improved  capabilities  of  the  computational  platform  coupled  with  improved  numerical  techniques 
results  in  greatly  enhanced  accuracy  while  maintaining  adequate  computational  efficiency. 

The  MOM  solution  technique  provides  numerical  solutions  at  the  cost  of  requiring  manipulation  of 
very  large  systems  of  algebraic  equations  whose  coefficients  are  often  given  in  terms  that  are 
computationally  expensive.  For  this  reason,  this  work  emphasizes  strategies  to  minimize  computation  time 
and  storage  requirements.  Some  of  these  strategies  include  adaptive  partitioning  of  the  scattering  body; 
non-uniform  partitioning  of  the  body;  and  the  usage  of  alternative  basis  functions,  including  basis 
functions  incorporating  the  correct  singular  behavior  near  edges.  The  residual  and  the  singular  value 
decomposition  (SVD)  are  used  to  diagnose  solution  convergence  and  accuracy. 

Results  to  be  presented  show  promise  of  attaining  the  goal  of  simultaneously  enhancing  accuracy 
while  maintaining  acceptable  efficiency,  especially  on  massively-parallel  platforms.  Specific  cases  to  be 
presented  include  application  of  the  strategies  outlined  above  to  the  cases  of  scattering  from  2-dimensional 
metallic  and  dielectric  scatterers. 

1 .  INTRODUCTION 

1.1  Background 

Computational  accuracy  has  been  widely  recognized  as  a  critical  issue  in  applied  electromagnetics. 
At  the  same  time,  many  applications  involve  a  conflicting  design  goal  of  maximum  computational 
efficiency.  This  paper  summarizes  a  study  of  new  techniques  implemented  on  both  scalar  and  massively 
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parallel  high  performance  computers  to  analysis  of  electromagnetic  scattering  from  a  number  of  targets. 
The  goal  of  this  effort  is  to  use  new  numerical  techniques  on  scalar  and  massively  parallel  computaters  to 
improve  computational  accuracy  without  sacrificing  computational  efficiency. 

Integral  equation  formulations  such  as  the  MOM  are  one  of  the  most  widely  used  analytical 
approaches  in  the  area  of  computational  electromagnetics,  and  are  the  formulations  utilized  in  this  work. 
Some  of  the  reasons  for  the  popularity  of  integral  equation  approaches  include  the  following;  a)  integral 
equation  formulations  are  typically  bounded,  unlike  their  differential  counterparts,  leading  to  improved 
numerical  stability;  b)  such  approaches  comprise  an  exact  formulation  of  Maxwell’s  equations,  and 
approximations  are  introduce  only  in  the  numerical  solution  of  the  exact  formulation;  c)  the 
electromagnetic  boundary  conditions  are  incorporated  in  a  natural  manner  into  the  formulation;  and  d)  a 
host  of  standard,  well-understood  techniques  exist  for  the  numerical  solution  of  integral  equations. 

The  MOM  is  noted  for  its  general  applicability  and  accuracy.  However,  these  advantages  typically 
accrue  at  the  cost  of  requiring  manipulation  of  large  systems  of  algebraic  equations  whose  coefficients  are 
often  given  in  terms  that  are  computationally  expensive.  For  this  reason,  this  work  emphasizes  strategies 
to  minimize  computation  time  and  storage  requirements.  Some  of  these  strategies  include  adaptive 
partitioning  of  the  scattering  body;  non-uniform  partitioning  of  the  body;  and  the  usage  of  alternative  basis 
functions,  including  basis  functions  incorporating  the  correct  singular  behavior  near  edges.  The  residual 
and  the  singular  value  decomposition  (SVD)  are  used  to  diagnose  solution  convergence  and  accuracy. 

The  primary  computational  platforms  used  in  this  study  include  a  Macintosh  Quadra  (68040 
processor),  a  486-DX2  PC,  a  Sun  Sparc20,  and  a  massively  parallel  DECmpp  12000  (MasPar)  computer. 
This  SIMD  computer,  together  with  five  DEC  5000  workstations,  has  been  donated  to  Florida  Atlantic 
University  by  Motorola,  Incorporated.  The  data  processing  unit  of  the  MasPar  consists  of  4096 
processors,  each  with  64  kB  of  memory,  connected  by  X-net  and  a  global  router.  The  programming 
language  used  is  Fortran  90,  with  C-I-+  available  for  later  applications. 

Results  to  be  presented  show  promise  of  attaining  the  goal  of  simultaneously  enhancing  accuracy 
while  maintaining  acceptable  efficiency.  Specific  cases  to  be  presented  include  application  of  the  strategies 
outlined  above  to  the  cases  of  scattering  from  2-dimensional  metallic  and  dielectric  scatterers. 

1.2  Typical  Results 

The  figures  below  depict  typical  results  of  several  computations  of  the  longitudinal  current 
distribution  in  the  TM  case  on  a  two-dimensional  perfectly-conducting  strip. 


PC  Execution  - 1 0  Partitions  PC  Execution  -  30  Partitions 
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The  solution  technique  used  is  the  usual  MOM  formulation  with  pulse-function  expansion  and  point 
matching.  The  two  data  sets  were  obtained  from  two  common  computers:  a  486-DX2  and  a  Sun 
SPARC-20.  In  each  case,  the  number  of  partitions  of  the  strip  (and  hence  the  number  of  unknowns  in  the 
resultant  matrix  equation)  were  gradually  increased. 

It  is  obvious  that  use  of  too  few  partitions  leads  to  an  unacceptable  "stair-step"  approximation  of  the 
actual  continuous  current  distribution.  As  the  number  of  partitions  is  increased  the  "stair-step" 
approximation  becomes  closer  to  the  expected  continuous  current  distribution.  The  residual  R  may  be 
used  to  measure  the  deviation  of  the  "stair-step"  approximation  of  the  current  distribution  from  the  actual 
continuous  current  distribution.  If  we  denote  the  exact  current  distribution  on  the  strip  as  /^(x)  and  denote 
the  MOM  approximation  to  the  current  distribution  on  the  strip  as  /^(x) ,  then  the  residual  is  defined  as 

R  =1  y^(x)  -  y^{x]  j .  It  is  most  usual  to  use  the  Euclidian  norm,  R  =  J  sj /^^(x)  -  J^^(x)  dx ,  in  computations. 

The  residual  forms  a  "figure  of  merit"  which  decreases  as  the  MOM  approximation  to  the  current 
distribution  increases  in  accuracy.  Note  that  for  partition  numbers  below  a  certain  limit,  the  residual 
decreases  as  the  number  of  partitions  increases.  This  leads  to  common  conclusion  among  users  of  the 
MOM:  a  larger  number  of  partitions  gives  a  better  result  for  the  unknown. 

However,  the  figures  show  that  this  rule  of  thumb  cannot  be  used  indiscriminately.  We  see  that  use 
of  too  large  a  number  of  partitions  results  in  current  distributions  which  are  obviously  erroneous.  A 
complicating  factor  is  also  apparent  from  the  figures:  the  Sparc20  can  handle  more  partitions  than  the  486¬ 
DX2  before  giving  spurious  results.  The  qualitative  reason  for  this  discrepancy  is  that  the  Sparc20  uses  a 
larger  binary  word-size  to  represent  numbers,  and  thus  the  round-off  errors  causing  spurious  results  for 
large  number  of  partitions  do  not  become  debilitating  until  a  larger  number  of  partitions  are  used. 

The  condition  number  C  of  the  MOM  matrix  may  be  used  as  a  figure  of  merit  in  determining  how 
nearly  singular,  and  hence  how  prone  to  round-off  error,  the  MOM  system  of  equations  is.  The  smaller 
the  condition  number  of  the  MOM  matrix,  the  less  round-off  errors  will  affect  the  solution. 

There  is  a  well-known  numerical  routine,  called  the  Singular  Value  Decomposition  (SVD),  which 
may  be  used  to  compute  the  condition  number  of  matrix.  We  make  use  of  the  SVD  routine  found  in 
LINPAC.  The  SVD  may  also  be  used  for  many  other  useful  matrix  operations.  It  can  be  used  to  invert 
the  linear  system,  it  can  be  used  to  solve  in  cases  where  the  number  of  equations  and  unknowns  are  not 
equal,  and  it  can  be  used  to  give  an  accurate  solution  even  in  cases  where  the  linear  system  is  ill- 
conditioned  (with  a  condition  number  which  is  too  large). 

Returning  to  the  figures,  we  see  that  we  have  two  conflicting  sources  of  numerical  error  in  our 
solutions:  increasing  the  number  of  partitions  reduces  the  residual,  giving  better  current  distributions,  but 
increasing  the  number  of  partitions  too  far  increases  the  condition  number  of  the  linear  system,  leading  to 
inaccurate  solutions  due  to  round-off  error. 

1.3  Goal  of  Study 

In  this  study  we  will  use  the  residual  and  the  condition  number  in  conjunction  with  the  MOM  in 
order  to  obtain  optimally-accurate  solutions  to  electromagnetic  scattering  problems.  Since  both  the  residual 
and  the  condition  number  of  a  MOM  system  may  be  computed  automatically,  the  computer  can  be 
programmed  to  alter  the  number  of  partitions  used,  and  can  itself  decide  when  the  optimal  solution  is 
achieved.  In  addition,  the  same  code  can  be  run  on  different  machines  with  differing  word-sizes,  and  the 
optimal  solution  obtained  will  be  optimal  for  the  particular  computer  used.  For  example,  on  a  486-DX2, 
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the  optimal  solution  may  be  obtained  for  90  partitions,  whereas  for  a  Sparc20  the  optimal  solution  may  be 
obtained  for  300  partitions. 

The  drawback  to  this  automated  solution  technique  is  computation  time.  The  SVD  routine  is  an 
operation,  where  N  is  the  number  of  partitions.  Then  the  solution  must  be  repeated  for  varying  numbers 
of  partitions  until  an  accuracy  figure  of  merit  (combining  R  and  C )  is  minimized.  For  this  reason,  this 
automated  solution  technique  is  ideally  suited  for  high-performance  computing.  Since  the  bulk  of  the 
computation  is  linear  algebra,  a  massively-parallel  SIMD  machine,  such  as  the  MasPar,  is  an  ideal 
computational  engine. 

2.  FORMULATION 

2.1  Discussion 

The  mathematical  formulation  used  here  is  the  standard  MOM  technique  adapted  to  the  TM  and  TE 
cases  of  field  incidence.  In  the  TM  case  the  incident  electric  field,  and  hence  the  induced  currents,  are 
along  the  axis  of  the  cylindrical  body  of  infinite  length.  In  the  TE  case  the  incident  field,  and  hence  the 
induced  currents,  are  transverse  to  the  axis  of  the  cylindrical  body  of  infinite  length. 

The  general  integral  equation  in  either  case  is  obtained  by  equating  the  total  tangential  electric  field  on 
the  conducting  body  to  zero.  Denoting  the  incident  electric  field  as  the  scattered  electric  field  as  , 
and  a  unit  tangent  vector  to  the  conductor  as  f ,  we  demand  t  -  =  -h  E  ,  In  terms  of  the  current  /(yd) 

on  the  cylinder,  this  gives  the  integral  equation 

i-(k^  +  vv  ■) G{p,  p')](p')  dS  =  -  jC0£  t  ■  e\P),  pes 

Here  G{p,  p')  is  the  two-dimensional  unbounded-space  Green’s  function  G{p,  p')  =  ^  p-p'\\  ■ 

This  equation  will  be  specialized  to  the  TM  and  TE  cases  in  numerical  implementation.  We  begin 
with  the  TM  case. 

2.2  TM  Case 

The  above  integral  equation  simplifies  for  TM  incidence  to 
G{p,  p')  Jip’] dS'  =  -  jcoeEiip),  peS 

We  will  analyze  the  simplest  case  of  scattering  from  a  flat,  perfectly-conducting  strip  of  width  vv .  Using 

N 

pulse-function  expansion  and  point-matching  results  in  the  linear  system  =  /  =  1,---,A^ .  Here 

the  matrix  elements  and  forcing  vector  are  given  by 


A,= 


^  I  - ;•)],  i^j  ,  , 

^  [l-filog(^)].  i-i  ‘  ^ 
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This  case  was  implemented  on  a  486-DX2  and  a  Sun  SPARC-20  with  the  results  shown  above. 

2.3  TECase 


The  above  integral  equation  simplifies  for  TE  incidence  to 
Gip,  p]  jm  dS'  -  If  G{p  p]  pm  dS'  =  -  jcoeEiiPl  p  e  5 


We  will  again  specialize  to  the  simplest  case  of  TE  scattering  from  a  flat,  perfectly-conducting  strip  of 
width  w .  Using  triangle-function  expansion  and  pulse-function  matching  results  in  the  linear  system 

N 

y,  A- JC=b;  ■  Here  the  matrix  elements  and  forcing  vector  are  more  complicated,  and  will  not 

j=i  ^  ‘ 


be  shown. 

This  case  was  implemented  on  a  Macintosh  Quadra  and  a  Sun  Sparc20  with  results  very  similar  to 
those  of  the  TM  case. 


2 . 4  Adaptive  Uniform  Partitions 

The  figure  of  merit  formed  by  the  residual  and  the  condition  number  of  the  linear  system  may  be 
used  to  implement  an  adaptive  partitioning  scheme.  Such  a  code  could  find  the  solution  of  optimum 
accuracy  on  any  given  machine.  The  code  would  begin  with  a  coarse  partitioning  of  the  scatterer,  such  as 
=  15  ,  and  increase  the  number  of  partitions  by  ten  in  each  loop.  By  monitoring  the  figure  of  merit,  the 
number  of  partitions  would  keep  increasing  until  the  residual  grows  too  large  and  the  condition  number  of 
the  linear  system  increases  too  far. 

The  adaptive  code  would  increase  the  number  of  partitions  to  approximately  90  on  the  486-DX2,  to 
approximately  150  on  the  Macintosh  Quadra,  and  to  over  1000  on  the  Sparc20. 

2.5  Adaptive  Non-Uniform  Partitions 

The  condition  number  of  the  matrix  of  the  MOM  system  becomes  large  when  rows  of  the  matrix 
become  nearly  equal.  This  problem  occurs  in  those  rows  of  the  MOM  matrix  which  correspond  to 
expansion  coefficients  which  are  nearly  equal  and  are  far  from  the  edges  of  the  scatterer  (7  =  1  and 
j  =  N).  At  such  locations  the  current  distribution  is  slowly  varying  compared  to  the  current  variation  near 
the  edges  of  the  structure. 

A  method  of  improving  the  accuracy  of  the  solution  while  maintaining  or  reducing  the  number  of 
partitions  exists.  This  technique  is  to  use  larger  partition  sizes  at  locations  of  slowly-varying  currents,  and 
smaller  partition  sizes  at  locations  of  rapidly-varying  currents  (such  as  edges).  We  are  in  the  process  of 
implementing  this  strategy,  and  numerical  results  will  be  available  for  presentation  shortly. 

2.6  Singular  Edge  Basis  Functions 

Another  strategy  for  improving  the  accuracy  of  the  solution  without  increasing  the  number  of 
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partitions  is  to  use  appropriate  basis  functions  near  scatterer  edges.  There  are  well-known  results  for  the 
singular  behavior  of  longitudinal  currents  near  edges  and  angular  joints  in  perfect  conductors.  The  basis 
functions  at  these  locations  may  be  chosen  to  incorporate  the  correct  edge  singularities  in  such  cases.  The 
fact  that  the  basis  function  models  the  actual  current  better  than  pulse-functions  or  triangle-functions 
should  allow  for  the  use  of  fewer  basis  functions  near  these  edges,  thus  improving  accuracy  while 
reducing  the  order  of  the  resultant  linear  system. 

3.  IMPLEMENTATION 

3.1  Scalar  Implementation 

The  implementation  of  the  adaptive  uniform  partitioning  scheme  for  the  case  of  TM  and  TE  scattering 
from  a  perfectly-conducting  strip  is  complete.  Implementation  of  the  technique  of  non-uniform  adaptive 
partitioning  is  proceeding,  and  initial  results  look  promising.  Final  results  will  be  available  shortly. 

The  formulation  involved  in  utilization  of  singular  edge  basis  functions  is  underway.  Initial 
numerical  results  of  this  technique  should  be  forthcoming  shortly. 

3.2  Massivelv-Parallel  Implementation 

Initial  results  for  implementation  of  the  adaptive  uniform  partitioning  scheme  on  the  MasPar  have 
been  obtained.  Porting  the  scalar  code  to  the  MasPar  has  been  greatly  facilitated  by  the  extensive  linear 
algebra  libraries  which  exist  for  the  machine  (even  via  the  Internet).  The  most  computationally-intensive 
portion  of  the  code,  calculating  the  Singular  Value  Decomposition,  exists  as  subroutine  ready  to  use. 

To  date  we  have  ported  our  code  to  the  MasPar  with  only  the  SVD  subroutine  executing  in  Fortran 
90.  This  means  that  much  of  the  overhead  code  is  executed  on  the  DEC  5(X)0  front-end,  and  only  the  most 
numerically-intensive  linear  algebra  portions  (such  as  SVD)  execute  on  the  Data  Procession  Unit  (DPU). 

Other  portions  of  the  code  could  benefit  from  parallel  execution,  and  these  segments  are  currently 
being  re-written  for  execution  on  the  DPU.  Initial  results  of  parallel  execution  show  the  expected 
dramatically-decreased  time  of  execution,  but  the  optimal  number  of  partitions  in  the  MasPar  is  quite  small. 
Initial  results  indicate  that  the  optimal  number  of  partitions  is  on  the  order  of  100,  comparable  to  the  results 
of  the  486-DX2.  The  probable  reason  for  this  is  that  each  of  the  4096  processors  in  the  DPU  are  relatively 
simple,  unlike  the  Alpha  processors  in  the  Sparc20. 

As  the  more  complex  versions  of  the  code  are  completed  for  scalar  execution,  they  will  be  adapted 
for  parallel  execution  on  the  MasPar. 

4.  CONCLUSIONS 

4.1  Results  to  Date 

The  final  results  obtained  to  date  indicate  that  an  adaptive  uniform  partitioning  strategy  could  provide 
a  valuable  tool  in  the  accurate  MOM  analysis  of  scattering  from  two-dimensional  conducting  bodies  in  a 
multi-platform  environment.  The  optimal  results  will  be  obtained  regardless  of  the  machine  utilized  in  the 
computation. 

The  adaptive  uniform  partitioning  code  has  been  ported  to  the  MasPar  with  only  the  SVD  subroutine 
executing  in  Fortran  90.  Incremental  modification  of  the  remainder  of  the  code  is  underway  to  reduce  the 


222 


amount  of  overhead  code  that  is  executed  on  the  DEC  5000  front-end,  and  total  code  execution  on  the 
(DPU)  should  be  forthcoming  shortly.. 

The  non-uniform  adaptive  partitioning  technique  is  nearly  complete  for  scalar  execution.  Scalar  and 
parallel  results  should  follow  easily.  Incorporation  of  singular  edge  basis  functions  is  in  the  formulation 
stage.  Initial  results  should  be  available  in  the  near  future. 
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Abstract 

For  the  past  five  years  the  U,  S.  Coast  Guard  Academy  has  been  conducting  research  on 
shipboard  antenna  patterns  using  scaled  wood  models  of  Coast  Guard  Cutters  painted  with  a  conductive 
coating.  Using  a  comer  of  the  Antennas  Laboratory  we  set  up  a  anechoic  chamber  with  a  horizontal 
turntable.  The  results  from  the  models  tested  have  correlated  well  with  the  test  results  from  actual  cutters 
and  with  the  Numerical  Electromagnetic  Code  (NEC)  approximations. 

Last  year,  as  a  senior  year  undergraduate  design  course  project,  the  Academy  was  asked  to 
investigate  the  possibility  of  using  Near  Vertical  Incidence  Skywaves  (NVIS)  as  a  method  of  improving 
the  110’  WPBs  (patrol  boat)  HF  communications  capabilities.  To  complete  this  study,  we  needed  a  way 
to  measure  the  radiation  pattern  of  the  antenna/ship/sea  surface  system.  We  had  previously  observed 

patterns  only  as  a  function  of  equatorial  angle,  <]),  with  the  polar  angle,  9,  equal  to  approximately  85 
degrees.  Other  researchers  have  used  an  arch  system  constructed  over  the  ground  plane.  We  attempted 
to  use  this  technique,  but  results  were  unsatisfactory.  We  came  up  with  a  creative  way  of  solving  the 
problem.  Instead  of  moving  the  illumination  antenna  over  the  model,  we  discovered  that  we  could  move 
the  model  and  ground  plane  while  leaving  the  illuminating  antenna  fixed.  We  constructed  a  lightweight 
ground  plane  Aat  could  be  mounted  vertically.  The  model  was  attached  to  the  ground  plane,  and  the 
entire  system  was  rotated.  By  changing  the  aspect  of  the  model  on  the  ground  plane  through  90°,  we 
were  able  to  collect  pattern  cuts  over  the  entire  hemisphere  above  the  "sea  surface". 

The  results  from  the  testing  showed  that  the  rotating  ground  plane  worked  and  that  NVIS 
showed  some  promise  for  the  110'  patrol  boats.  Funding  for  work  on  the  project  was  continued.  This 
year,  a  second  student  will  test  alternative  NVIS  antenna  designs,  and  investigate  HF  circuit  behavior 
using  the  "Prophet"  propagation  prediction  software.  Finally,  we  anticipate  implementing  the  most 
promising  antenna  design  on  an  actual  Coast  Guard  patrol  boat 

It  is  our  hope  that  the  ideas  presented  here  will  be  of  value  in  the  undergraduate  Antenna 
Laboratory  to  help  students  better  understand  the  idea  of  a  three-dimensional  antenna  radiation  pattern. 
We  have  found  the  technique  helpful  in  our  research  and  an  inexpensive  solution  to  the  problem  of 
observing  the  electromagnetic  field  over  the  ground  plane. 

Background 

Our  normal  method  of  measuring  antenna  patterns  has  been  to  place  the  ship  model  on  a  copper 
topped  turn  table  four  feet  in  diameter.  This  table  top  rotates  through  3^°.  We  illuminate  the  model  and 
table  with  a  transmitting  antenna  outside  the  anechoic  comer,  and  record  the  magnitude  of  the  signal 
received  on  the  model  antenna.  This  technique  gives  us  azmuthal  information,  but  no  elevation 
information.  Moving  the  illuminating  antenna,  while  possible  presents  new  problems  with  in-roora 
reflections  since  our  chamber  is  only  one  comer  of  our  Antennas  Laboratory  room. 
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To  obtain  elevation  information  from  our  models,  one  possible  approach  is  to  construct  an  arch 
over  the  rotating  table  top.  We  could  then  illuminate  the  model  with  an  antenna  that  moved  over  the 
arch.  This  technique  is  used  by  the  antennas  group  at  the  Naval  Undersea  Warfare  Center  in  New 
London,  CT. 

A  couple  years  ago  we  constructed  such  an  arch  to  measure  elevation  information  from  a  simple 
monopole  antenna.  After  repeated  attempts,  we  concluded  that  the  feed  cable  for  the  ^ch-mounted 
antenna  was  contaminating  the  results,  and  that  this  arch  method  would  not  woiic  in  our  simple-minded 
antenna  chamber.  To  develop  a  more  refined  system  using  an  arch  arrangement  over  the  ground  plane 
would  require  the  receiver  or  transmitter  to  ride  on  the  arch.  This  sounded  expensive  and  more  complex 
than  we  were  willing  to  implement 

Azmuthal  patterns  in  the  elevation  plane 

During  discussions  with  the  co-authors,  our  student  asked  the  silly  question,  "Why  not  turn  the 
table  on  edge  and  rotate  the  whole  diing  in  the  plane  that  we  usually  rotate  the  anteimas  in  the  chamber?" 
We  thought  about  this  ridiculous  suggestion  for  a  couple  minutes,  and  then  we  decided  that  the  idea  was 
perhaps  not  so  far  out  of  the  plane  of  possibilities. 

Design  Considerations 

Developing  a  functioning  ground  plane  required  us  to  consider  four  major  criteria.  First,  the 
entire  system  had  to  be  light  enough  to  be  turned  by  our  existing  rotating  system.  This  system  was 
capable  of  turning  a  weight  of  less  than  25  kg's.  Second,  we  needed  a  ground  plane  that  would  be  rigid 
enough  so  that  it  would  not  flex  during  rotation.  We  were  able  to  determine  that  starting  and  stopping 
were  the  times  of  largest  flex.  The  tiiird  problem  area  we  addressed  was  how  to  change  the  orientation 
of  the  model  so  that  we  could  observe  elevation  pattern  information  at  different  azimuAs  with  respect  to 
the  bow  of  the  ship.  The  final  problem  to  be  resolved  was  how  to  defy  the  law  of  gravity  and  keep  the 
ship  model  firmly  in  contact  with  the  ground  plane  surface.  On  the  table  this  had  not  b^n  a  problem 
because  gravity  was  very  cooperative. 

Viewed  from  the  completed  project  side  of  this  effort,  many  of  the  aspects  of  this  solution  seem 
obvious.  While  there  is  clearly  not  too  much  rocket  science  included  in  our  solution  to  the  problem,  I 
believe  that  the  student  was  creative  in  viewing  the  entire  problem  from  a  new  angle.  A  sketch  of  the 
final  results  is  presented  in  Figure  1.  While  a  thousand  words  have  been  saved  by  using  this  figure, 
allow  me  to  point  out  some  important  details.  First,  note  that  while  one  would  first  expect  that  we  could 
have  to  rotate  the  model  180®  on  the  ground  plane  to  get  an  entire  set  of  elevation  patterns  for  the  model, 
after  carefully  trying  a  couple  orientations,  one  quickly  can  see  that  we  only  need  to  move  the  model 
through  90°  to  get  complete  elevation  information.  One  other  consideration  which  is  obvious  to  us,  but 
not  necessarily  to  the  student  at  first  is  the  requirement  that  all  hardware  holding  the  ship  in  place  should 
be  non-conducting  material  so  as  to  not  interfere  with  the  antenna  radiation  pattern.  Developing  a 
clamping  arrangement  that  would  hold  the  model  firmly  against  the  ground  plane  and  made  out  of  wood 
was  rather  challenging.  Our  electronics  technician  did  a  wonderful  job  fabricating  the  clamps  and 
wooden  bolts  that  are  now  used. 

Advantages  of  this  approach 

There  are  four  advantages  to  this  approach  based  on  the  anechoic  chamber  that  we  have  at  the 
Coast  Guard  Academy,  First  of  all,  we  were  able  to  use  the  existing  instrumentation  that  we  use  for 
normal  equatorial  patterns.  Second,  the  solution  was  inexpensive  to  implement.  Third,  we  use  the  same 
software  that  we  use  for  azmuthal  pattern  collection  since  we  are  using  the  same  rotation  system  that  we 
have  used  previously.  Fourth,  the  entire  ground  plane  and  model  system  attached  to  a  fiberglass  pole 
and  can  be  installed  and  removed  by  no  more  than  two  people. 
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Figure  1.  Sketch  of  vertical  ground  screen  with  ship  model  mounted. 

This  orientation  will  give  a  pattern  cut  over  the  top  of  the  ship 
at  an  azmuthal  angle  of  about  40°  off  of  the  starboard  bow  to 
40°  off  of  the  port  side  of  the  stem. 

Disadvantages  we  have  found 

One  major  disadvantage  we  have  found  is  securely  clamping  the  model  to  the  ground  plane.  At 
some  aspect  angles  the  model  stays  in  place  well,  but  in  other  orientations  the  model  wants  to  jump  out 
of  the  holding  blocks.  Our  simple  solution  for  this  problem  is  to  tie  the  model  in  place  with  a  piece  of 
string.  A  second  less  significant  disadvantage  is  that  the  ground  plane  rotates  through  360°.  When  the 
ground  plane  is  between  the  model  and  the  transmitting  antenna,  the  information  is  meaningless.  We 
choose  to  ignore  this  part  of  the  data  and  present  data  observed  when  the  antennas  on  the  ship  model  are 
"above  the  horizon." 

Experimental  Results 

Figure  2  presents  the  results  of  testing  the  ship  model  on  the  vertical  ground  screen  in  our 
anechoic  chamber.  Tests  were  conducted  at  a  scaled  frequency  of  6.0  MHz  and  we  used  four  different 
model  orientations  of  the  model.  The  first  pattern  (0°)  was  mn  with  the  model  oriented  so  that  the 
pattern  cut  from  from  bow  to  stem  over  the  top  of  the  model.  We  then  did  plus  and  minus  45°  and  the 
90°  cut  was  what  we  sailor  call  the  athwartship  pattern  cut.  Points  that  became  important,  from  the 
student's  perspective  were  the  overhead  points  and  points  at  the  horizon.  Initially,  Mr.  Nutter  collected 
the  antenna  patterns  on  different  days,  and  did  not  attempt  to  normalize  anything  to  what  he  had 
observed  earlier.  When  the  patterns  were  superimposed,  he  concluded  that  his  data  was  less  than  perfect 
because  the  patterns  failed  to  align  at  these  points.  Again,  while  obvious  to  us,  to  the  student,  this  point 
became  a  key  to  understanding  of  what  was  really  going  happening  with  the  patterns.  We  reran  the 
patterns,  paying  careful  attention  to  the  signal  levels  and  equipment  settings,  and  the  results  agreed  as 
can  be  seen  in  Figure  2. 
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Figure  2.  Measured  radiation  patterns  from  1 10'  patrol  boat  model  at 
scaled  frequency  of  6.1  MHz.  and  at  angles  of  -45°,  0°,  45°, 
and  90°  with  respect  to  the  ship's  bow.  Notice  agreement  at 
the  horizon  and  directly  overhead. 

Figure  3  presents  the  results  of  one  NEC  run  of  our  model  at  a  frequency  of  6.0  MHz  with  the  pattern 
cut  set  for  "over  the  keel"  or  0°  from  stem  to  bow.  Based  on  general  shape  of  the  pattern,  we  were  quite 
excited  about  these  results.  Figure  4  shows  a  comparison  of  the  data  from  the  anechoic  chamber  and  the 
NEC  generated  pattern  on  the  same  axis.  Obviously,  we  have  some  magnitude  disparities  that  mi^t  be 
resolved.  One  immediate  observation  that  we  make  is  that  generally,  the  shape  of  die  patterns  is  similar. 
A  second  observation  is  that  the  NEC  pattern  is  assuming  a  perfect  ground,  while  our  vertical  ground 
screen  is  not  a  perfect  ground,  but  appears  to  be  more  like  we  would  expect  from  a  sea  surface. 
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Figure  3.  Radiation  pattern  predicted  by  NEC  at  a  frequency  of  6.0  MHz 
from  bow  to  stem  over  the  top  of  the  model  (0°  azmuth). 
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Figure  4.  Comparison  of  the  ratiation  patterns  measured  in  our  anechoic 
chamber  and  predicted  using  NEC  code.  Note  especially  the 
similarity  in  general  shape.  The  differences  in  levels  at  the 
horizon  are  still  being  investigated. 


Conclusions 

A  light  weight  rotatable  ground  plane  was  constructed  that  would  hold  a  model  ship  vertically  so 
that  elevation  antenna  patterns  could  be  observed  by  rotating  the  ground  plane  and  ship  azrauthally.  By 
changing  the  orientation  of  the  ship  model  on  the  ground  plane  pattern  cuts  at  different  angles  are 
recorded.  Patterns  observed  on  this  ground  plane  have  been  compared  to  numerical  results  generated  by 
the  Numerical  Electromagnetic  Code.  The  patterns  from  both  methods  agree  favorably.  This  simple 
technique  will  be  used  more  as  we  continue  our  research  into  antenna  characteristics  on  Coast  Guard 
Ships.  This  rotating  vertical  ground  plane  is  a  simple  method  of  observing  antenna  elevation  patterns 
without  resorting  to  expensive  arch  systems  with  the  accompanying  arch  mounted  probes. 
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Abstract 

Conjugate  gradient  EFT  (CGFFT)  methods  have  previously  been  reported  for 
inhomogeneous  dielectric  scatterers  and  planar  conductors  and  both  far-fields  and 
current  distributions  on  the  scatterer  have  been  included.  This  paper  will  explain  how 
the  near-fields  can  be  efficiently  determined  from  such  methods.  A  comparison  of 
near- field  results  will  be  given  with  method  of  moment  solutions  for  mixed 
dielectric/pec  scatterers  in  the  resonance  region. 

INTRODUCTION 

Conjugate  gradient  FFT  (CGFFT)  methods  have  previously  been  reported  to 
solve  the  electric  field  integral  equation  (EFIE)  for  scatterers  in  the  resonance  region. 
In  common  with  most  method  of  moment  (MoM)  schemes  CGFFT  methods  usually 
solve  for  the  current  distribution  on  the  scatterer  from  which  far-fields  can  be 
determined  with  negligibly  CPU  costs  via  standard  techniques. 

The  determination  of  near-fields  in  a  local  region  to  the  scatterers  is  not  as 
straightforward  as  that  for  the  far-fields.  In  the  MoM  schemes  the  impedance  matrix 
has  to  be  recalculated  and  this  along  with  further  matrix  multiplications,  can  take  a 
significant  amount  of  CPU  for  a  scatterer  in  the  resonance  region.  This  procedure 
would  need  to  be  repeated,  possibly  many  times,  to  “see”  the  whole  near- field  region 
associated  with  a  scatterer  and  would  rapidly  exhaust  most  computing  facilities. 

This  paper  will  discuss  how  near-fields  can  be  efficiently  determined  via 
CGFFT  methods  [1,  2,  3]  at  a  fraction  of  the  computing  costs  associated  with  standard 
MoM  schemes.  Near-field  plots  which  have  been  locally  validated  against  a  MoM 
code  which  solves  the  combined  field  integral  equation  (CFIE)  [4],  will  be  given  for  a 
range  of  dielectric  and  mixed  dielectric/pec  scatterers  in  the  resonance  region. 
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EVALUATING  NEAR-FIELDS 


There  are  basically  3  classes  of  scatterer  to  which  the  CGFFT  schemes  have 
been  applied,  namely  inhomogeneous  dielectrics  [2]  resonant  pec  plates  [1]  and  mixed 
dielectric/pec  scatterers  [3].  The  computation  of  the  near-fields  is  similar  for  each  of 
these  schemes  and  will  be  discussed  for  the  mixed  dielectric/pec  scatterer  problem. 
The  scheme  uses  a  pulse-basis  formulation  to  solve  the  EFIE  and  takes  the  form 


Ga 

_ 1 

Gdd. 

Jd. 

1 _ 1 

where  Ec‘  and  Ed'  are  the  known  incident  fields  on  the  conductor  and  dielectric  regions 
respectively,  L  is  the  unknown  induced  current  on  the  pec  surfaces  and  Jd  the  unknown 
polarization  current  in  the  dielectric.  The  Green’s  dyadic  terms  in  the  impedance 
matrix  represent  the  self  and  mutual  coupling  between  the  different  regions.  The 
significance  of  the  CGFFT  scheme  is  that  the  FFT  of  each  of  the  blocks  in  the 
impedance  matrix  requires  only  a  storage  of  order  N,  where  N  is  the  number  of 
unknowns,  whereas  the  equivalent  blocks  in  the  spatial  domain  of  MoM  schemes 
typically  require  a  storage  of  order  N“  and  N  can  be  very  large  for  a  resonant  structure. 

Once  a  solution  to  (1)  has  been  computed  the  matrices  Gcc  etc  can  be  readily 
expanded  to  etc  with  a  regular  mesh  into  the  whole  of  the  near-field  region. 

The  current  matrices  are  also  expanded  filling  in  the  region  outside  the  scatterer  with 
zeros.  Thus  whole  regions  of  the  scattered  near- field  Es,  can  be  computed  using 
expanded  versions  of  either  the  top  or  bottom  row  of  (1),  eg.  using  the  top  row: 


(2) 


RESULTS 

Near-fields  will  be  shown  from  several  mixed  dielectric/pec  scatterers  and  these 
will  be  compared  to  the  results  from  the  MoM  applied  to  the  C.F.I.E.  A  typical 
example  of  the  computed  near-field  is  shown  for  the  scatterer  given  in  Fig.  1  which 
comprises  a  dielectric  cuboid  centrally  placed  in  contact  with  a  pec  plate  in  the  x-y 
plane.  The  scatterer  is  illuminated  along  the  -z  axis  with  an  x-polarised  plane  wave. 

Fig.  2  shows  the  magnitude  of  the  x-component  of  a  section  of  the  scattered 
electric  field  for  the  case  of  a  0.2?io  dielectric  cube  of  permittivity  =  9  and  a  square 
pec  plate  of  side  0.4A,o.  The  section  is  taken  perpendicular  to  the  y-axis  through  the 
centre  of  the  scatterer.  The  plot  clearly  identifies  the  scattering  centres  associated 
with  the  edges  of  the  pec  plate  and  the  faces  of  the  cube.  A  uniform  grid  mesh  of 
0.033^0  was  used  results  inal2xl2x6  solution  domain  for  the  CGFFT  method. 
When  the  solution  had  been  determined  the  domain  was  expanded  with  the  same  mesh. 
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Dielectric  block 


to  18x  18x  12  to  compute  the  near-field  surrounding  the  whole  scatterer.  Views  of 
the  near-field  as  given  in  Fig.  2  are  then  only  restricted  by  the  graphics  available  to  the 
user  and  further  views  require  no  more  computations. 
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1.  Introduction. 

For  many  applications  of  electromagnetic  theory  -  from  radio  propagation  to  X-ray  optics,  it  is 
extremely  useful  to  have  a  detailed  quantitative  and  visual  representation  of  the  wave  field  produced 
by  large  quasioptic  devices  being  designed  and  used  as  antennas,  focusing  lenses  or  elements  of 
transmitting  lines.  An  effective  tool  for  interactive  computer  aided  design  (CAD)  requires  selection 
of  a  powerful  and  logically  simple  computational  algorithm  and  its  implementation  into  a  flexible 
user-friendly  computer  code.  Such  program  has  to  permit  easy  change  of  the  designed  system 
configuration  and  parameters  and  immediate  display  of  the  effects  of  each  modification. 

Quite  often,  the  use  of  sophisticated  analytical  methods  or  time-  consuming  straightforward 
numerical  algorithms  does  not  allow  to  reach  the  desirable  effectiveness.  A  good  opportunity  to 
increase  the  efficiency  of  field  calculations  offer  the  algorithms  based  on  the  parabolic  equation 
method.  The  parabolic  equation  (PWE)  for  a  slowly  varying  wave  amplitude  can  be  solved  numer¬ 
ically  much  more  fast  then  the  exact  wave  equation.  This  stimulates  an  extensive  use  of  the  PWE 
in  computer  aided  design  of  EM  quasioptic  systems. 

We  demonstrate  a  computer  code  for  IBM-compatible  PC’s  modeling  a  wide  class  of  quasioptical 
systems  composed  from  lenses,  Fresnel  zone  plates  and  similar  dielectric  elements.  Such  software  can 
find  application  in  a  wide  range  of  EM  spectrum  including  antennas  for  microwave  communication 
and  remote  sensing  systems,  binary  optics,  X-ray  microscopy  of  biological  objects,  education  and 
training  in  optics  and  diffraction  theory,  etc. 

The  mathematical  algorithm  includes: 

•  finite-difference  approximation  of  the  PWE 

•  implementation  of  non-local  boundary  conditions  simulating  perfectly  transparent  boundaries 

•  a  priori  applicability  criteria  of  the  parabolic  equation  method 

•  practical  estimates  for  the  finite- difference  mesh  parameters 

The  computer  code  written  in  C-f-f  programming  language  provides: 

•  interactive  design  and  display  of  the  quasioptic  system  to  be  simulated 

•  a  restricted  library  of  predefined  optical  elements 

•  numerical  solution  of  the  PWE 

•  color  graphical  display  of  the  calculated  field  intensity 

•  interactive  calculation  regime  including  breakpoints  and  field  storage. 
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2.  Mathematical  Algorithm 

We  consider  diffraction  of  wave  packets  with  narrow  angular  spectrum  in  weak  nonuniform 
media  with  small  permittivity  variations  |£  -  1|  <  1.  To  simplify  calculations,  we  use  ,  instead  of 
the  exact  wave  equation 

d^E  d^E  d^E 

the  parabolic  wave  equation  (PWE)  [1,2] 

+  Axu  +  [£(x,2/,2r)E]  u  =  0  (2) 

oz 

for  the  slowly  varying  complex  amplitude 

U{x,y,z)  =  E[x,y,z)e~"^^  (3) 

Being  an  asymptotic  approximation  to  (1),  the  PWE  (2)  describes  adequately  the  main  diffraction 
phenomena  (’’transversal  diffusion”  [3])  in  the  nonuniform  dielectric  medium  as  well  as  the  free- 
space  propagation  and  focusing  outside  the  optical  elements.  Formally,  the  applicability  conditions 
of  the  PWE  (2)  can  be  obtained  by  putting  e  -  \  =  a(x^y^z)  =  a  (f;  ^’f)  ^  ^ 

characteristic  scales  of  transversal  and  longitudinal  nonuniformity  of  the  dielectric  permittivity. 
Estimation  of  the  neglected  terms  yields: 

kb\  a\<  1,  —  >1  (4) 

0 

These  conditions  are  usually  met  for  plasma  media  and  X-ray  optical  elements.  In  practice,  however, 
the  main  averaged  features  of  the  wave  held  can  be  described  correctly  even  under  less  restrictive 
assumptions. 

If  the  Fresnel  parameter  ka?  jh  proves  to  be  large,  it  is  possible  to  omit  in  (2)  the  diffusion  term 
Au  which  leads  to  the  plane  screen  approximation 


K  f 

=  u(x,t/,0)exp  i- j  cx{x,y,z)dz  s  T{x,y)u{x,y,0) 


for  a  quasioptical  element  localized  between  z  ~  0  and  z  ~  b. 

Outside  the  optical  element,  where  a  =  0,  the  PWE  takes  the  form 


du 

2ik-z — f-  Au  =  0 
oz 


and  has  a  general  solution 


ti(x,  y,z)  =  JJ  u{x',  y',  0)  G{x  -  x',y  -  y',  z)  dx'dy' 


r(  '  •  \  ^  \.Ax-  x'f  +  {y-y'? 

G{x-x,y-y,z)  =  —  ^xp  rh - - 
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Evidently,  (7)  is  nothing  but  a  paraxial  approximation  of  the  Kirchhoff  integral 


h  r  r 

E{x,  2)  =  ^  //  y'> 


However,  generally,  the  complete  PWE  (2)  is  to  be  used  to  take  into  account  the  diffraction  processes 
inside  the  element  body. 

Transition  from  rigorous  wave  theory  to  the  transversal  diffusion  approximation  leads  to  the 
change  of  kind  of  the  differential  equations  and  to  a  new  formulation  of  the  boundary  value  prob¬ 
lem.  In  contrast  to  the  elliptic  wave  equation,  the  PWE  (2)  describes  the  evolution  of  the  wave 
amplitude  in  process  of  almost  unidirectional  propagation  along  the  optical  axis.  Physically,  it 
means  neglecting  the  backward  reflections  from  the  interfaces  and  the  waves  diffracted  into  large 
off-axis  angles  (such  approximation  is  adequate  for  the  variations  of  the  refraction  index  are  sup¬ 
posed  to  be  small). 

For  the  parabolic  equation,  the  Cauchy  problem  is  correct  with  a  given  initial  distribution 
u(a;,y,0)  =  UQ{x,y)  and  with  some  radiation  condition  excluding  spurious  waves  coming  from 
infinity.  In  the  process  of  the  numerical  solution  of  the  problem  demanding  truncation  of  the  spatial 
domain,  one  has  to  seek  a  way  of  transferring  the  radiation  condition  from  infinity  to  a  certain  finite 
surface  surrounding  ail  the  sources  and  diffractive  elements.  Since  any  spurious  reflection  from  this 
artificial  boundary  would  change  the  wave  field  structure,  such  boundary  condition  has  to  provide 
full  transparency  for  arbitrary  radiation  from  inside. 

In  a  model  two-dimensional  case,  an  exact  form  of  the  transparency  condition 

^ix,±A)  =  ^e-'^/‘^^2k-^u{x,±A) 

has  been  found  in  [4,5] . 

In  three  dimensions,  we  transfer  the  radiation  condition  from  infinity  to  an  arbitrary  cylindric 
surface  r  =  A  surrounding  all  the  dielectric  elements  using  the  following  generalization  of  [4,5]: 


du  Id 


Z  ZTT 

I  j  u{A,i>,QK{^-  ()d(d^ 


Hm\A\/2ikp)  y/p 


In  the  most  interesting  axial  symmetric  case,  only  the  term  m  =  0  remains  in  (11)  .  Moreover, 
as  usually  the  integration  region  is  very  wide  compared  with  the  wavelength  {kA  >1),  the  Hankel 
function  can  be  replaced  by  its  far-field  asymptotic  expression  Then 

the  zero-order  term  in  (11)  can  be  calculated  explicitly: 


which  reduces  the  boundary  condition  (10)  to  the  form 
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coinciding  the  result  [4,5]. 

Confining  ourselves  to  the  axial  symmetric  case,  we  use  the  following  six-point  implicit  finite- 
difference  scheme 

+  <+i  - 

zilc  ^  +  2/12 

,  “mVl  “  +  “m+1  “  “m-1  ,  7.2  n+l/2“m'*'^  +  “m  _  n  (14^ 
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reducing  the  PWE  (2)  to  a  three-diagonal  set  of  linear  algebraic  equations.  Here,  r  and  h  are  the 
mesh  steps  in  z  and  r  directions,  and  ti"  sa  u{nT,mh).  As  the  coefficient  a{r,z)  =  e  -  1  may  have 
discontinuities  at  the  interfaces,  a  better  performance  shows  the  modified  scheme  where  coefficient 
is  replaced  by  its  weighted  average  value 

(n+l)i'  (m+l/2)/i 
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given  by  the  integral-interpolation  method.  The  linear  equations  (14)  written  for  m  —  0, 1, . . . ,  M 
A//i,  along  with  the  finite- difference  approximation  of  the  transparency  condition  (13) 


form  a  complete  set  of  equations  which  can  be  solved  step  by  step,  from  n  to  n-}- 1,  by  the  marching 
method.  The  energy  balance  law 

M  M 

m=0  m—0 

grants  convergence  and  stability  of  the  scheme  (14). 

3.  Accuracy  Estimation 

An  elementary  analysis  of  accuracy  of  the  PWE  approximation  can  be  made,  following 
G.Malyuzhinets,  by  comparing  an  exact  2D  plane  wave  representation 


E{z,y)  =  e' 


1-P^) 


with  its  ’’parabolic”  analog 


,  p  =  sin  j3 
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which  satisfies  the  PWE  (6).  It  is  clear  that  the  exponent  in  (19)  corresponds  to  the  first  two  terms 
of  the  Taylor  series 
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Demanding  the  inaccuracy  in  phase  be  small  compared  with  tt ,  one  gets  an  applicability  criterion 
of  the  PWE  : 

P'^Zmax  <  4A  (21) 
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It  bounds  the  angle  spectrum  of  the  waves  accurately  described  by  the  parabolic  equation  at  a 
given  distance  ^marr-  A  more  precise  approximation  of  the  square  root  in  (4.7)  would  result  in  a 
wide-angle  or  long-range  modification  of  the  PWE. 

The  same  approach  can  be  applied  to  the  error  estimation  of  the  finite- difference  scheme,  used 
to  solve  the  PWE,  on  a  plane  wave  functional  basis.  For  the  Crank-Nicolson  scheme  (14)  we  obtain 

=  u{nT^mh)  =  (22) 


where 
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This  solution  correctly  describes  the  plane  wave  (19)  in  the  ’’parabolic”  approximation  if  the  fol 
lowing  conditions  are  met: 
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(24) 


They  allow  one  to  choose  the  finite-difference  grid  steps  r  and  h  in  order  to  assure  the  required 
accuracy. 

The  problem  encountered  in  handling  practical  examples  consists  in  evaluating  the  spectrum 
limits  of  the  angles  =  sin~^p  contributing  into  the  wave  field  in  the  domain  of  interest. 

For  example,  the  field  in  the  focal  region  is  determined  by  an  integral  of  the  elementary  plane 
waves  over  the  whole  aperture  of  the  optical  element  (lens  or  zone  plate).  In  this  case,  the  maximum 
angle  of  diffraction  can  be  estimated  as  ^^ax  ^  R/f  where  R  is  the  element  radius  and  /  is  the 
focal  length. 


4.ComputerCode 

The  finite-defference  algorithm  of  solving  the  PWE  described  above  allows  for  fast  calculation 
of  the  global  wave  field  distribution  even  on  a  personal  computer.  The  diffraction  field  patterns 
visualized  by  means  of  color  graphics  contains  a  large  amount  of  information  useful  for  computer 
aided  design  of  complex  optical  systems  as  well  as  for  education  and  training  in  wave  theory. 

We  demonstrate  a  prototype  of  the  computer  code  for  modeling  a  wide  class  of  quasioptical 
systems  composed  of  lenses,  Fresnel  zone  plates,  dielectric  waveguides,  etc. 

The  main  features  of  this  program  are: 

•  interactive  design  and  display  of  the  optical  scheme  using  a  built-in  library  of  elements  (user 
has  to  choose  the  needed  ones  and  to  define  only  their  position  and  parameters)  ; 

•  a  priori  accuracy  estimations  of  the  PWE  and  finite-difference  scheme  realized  in  the  form  of 
warnings  and  cues  ; 

•  step-by-step  numerical  solution  of  the  PWE  in  a  truncated  domain  with  transparent  bound¬ 
aries; 

•  interactive  calculation  regime  with  breakpoints,  held  storage  and  complex  amplitude  plotting 
in  desired  cross-section  ; 

•  global  field  visualization  using  ajustable  color  intensity  palette. 
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1  Introduction 

Theoretical  dosimetr}'  defines  the  link  between  external  fields  and/or  source  configurations  which  can  be  measured 
to  dose  quantities  in  the  body  which  are  difficult  to  measure.  These  dose  quantities  such  as  the  specific  energy  absorption 
rate,  SAR  and  current  density  are  related  to  a  possible  insult  or  hazard.  The  key  to  future  progress  in  dosimetry  is  the 
development  of  anatomically  correct  voxel  (volume  pixel)  phantoms  from  medical  imaging  data. 

This  paper  presents  examples  of  Finite-Difference  Time-Domain  (FDTD)  calculations  of  SAR  in  an  anatomically 
realistic  model  of  the  head  at  frequencies  between  9(X)  MHz  and  2.45  GHz  for  a  plane  wave  source  and  for  a  monopole 
representing  a  generic  mobile  communications  transceiver. 

2  Model  of  the  Head 

A  fine  resolution,  anatomically  detailed  model  of  the  head  has  been  constructed  directly  from  a  sagittal  MRI  scan 
of  a  single  male  subject.  The  MRI  data  were  segmented  into  4  million,  1  mm  voxels  in  the  shape  of  cubes.  Each  cube  has 
a  lag  which  identifies  its  tissue  type.  Segmentation  which  is  the  identification  of  the  tissue  types  was  performed  using  the 
volume  rendering  code  Voxel  View  (1993)  and  in  particular  the  merging,  arithmetic  manipulation  and  grease  pencil  features; 
the  latter  allows  the  free-hand  insertion  and  erasure  of  voxels.  The  head  was  split  up  into  convenient  segments.  Each 
segment  was  then  divided  into  high  and  low  greyscale  tissues.  Each  classification  was  then  further  differentiated  by 
successive  editing  and  subtraction.  Any  anomalies  were  then  tidied  up  with  the  grease  pencil. 

The  tissues  were  skin,  muscle,  bone,  brain,  cerebrospinal  fluid  and  air  (sinuses).  The  original  eye  was  of  poor  clarity 
and  so  has  been  replaced  by  an  analytical  eye  of  concentric  spheres.  The  tissues  in  the  eye  are  the  lens,  humour  and  a 
composite  sclera-humour  layer.  The  hypothalamus,  pituitary  and  pineal  glands  were  also  edited  in  by  hand,  their  positions 
determined  from  an  anatomical  atlas  of  the  head.  The  FDTD  calculations  were  not  tractable  at  1  mm  resolution  so  the  model 
was  rescaled  to  2  mm  resulting  in  490,000  voxels  in  the  head. 

Figure  1  shows  an  annotated  axial  slice  through  the  eyes  of  the  2  nun  phantom. 

3  Numerical  Method 

The  Finite-Difference  Time-Domain  (FDTD)  method  has  been  well  documented  (see  e.g.  Taflove  and  Umashankar, 
1990;  Kunz  and  Luebbers,  1993)  and  only  a  brief  description  is  given  here.  The  computational  domain  is  divided  into  a  3D 
lattice  of  cells  which  are  assigned  individual  electrical  properties.  For  a  cubic-cell  space  lattice,  the  explicit  finite-difference 
procedure,  involves  positioning  the  components  of  the  electric  field,  £,  on  the  middle  of  the  edges  and  the  components  of 
the  magnetic  field,  H,  at  the  centre  of  the  faces  on  a  unit  cell  of  the  lattice.  The  calculation  then  leap-frogs,  evaluating  E 
and  H  at  alternate  half-time  steps.  Time-stepping  proceeds  until  equilibrium  has  been  reached.  A  domain  enclosing  the  target 
and  a  boundary  condition  on  the  surface  of  the  domain  must  be  chosen  to  absorb  the  outgoing  scattered  waves  (Mur,  1981). 
A  Faraday’s  law  contour  path  was  used  to  derive  the  circumferential  H  fields  immediately  adjacent  to  the  wire  (Taflove  et  al 
1988)  in  the  modelling  of  the  monopole.  The  monopole  was  excited  by  driving  a  voltage  across  the  one  cell  gap  between 
the  wire  antenna  and  the  top  surface  of  the  metal  box.  The  conducting  metal  box  was  treated  by  setting  the  electric  fields, 
tangential  to  the  surfaces,  to  zero.  Further  details  of  the  computational  method  and  tissue  electrical  properties  can  be  found 
in  Dimbylow  and  Gandhi  (1991)  and  Dimbylow  (1993). 

4  Results 

The  specific  absorption  rate,  SAR  is  defined  as 


SAR  = 
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where  a  is  the  electrical  conductivity,  p  the  density  and  E  is  the  root-mean  square  (rms)  electric  field. 

Figure  3  shows  a  histogram  of  the  SAR  across  the  section  through  the  eyes,  depicted  in  figure  1,  for  anteroposterior, 
AP  irradiation  of  the  adult  at  900  MHz.  The  front  of  the  head  faces  the  top  righthand  comer.  Energy  is  preferentially 
absorbed  in  the  high  conductivity  tissues  such  as  the  eye,  brain,  muscle  and  skin.  The  maximum  absorption  occurs  in  the 
eyes.  Bone  has  a  low  conductivity  resulting  in  lower  power  deposition  in  the  skull.  The  absorption  in  the  brain  reveals  a 
complex  standing  wave  pattern.  Figure  4  illustrates  a  surface  wave  or  whispering  gallery  effect.  The  head  is  being  irradiated 
from  the  side  at  2.45  GHz.  At  this  frequency  the  penetration  into  the  head  is  small  and  so  the  energy  absorption  should  be 
superficial  at  the  side  of  the  head.  However  there  is  significant  absorption  in  the  opposite  ear  from  a  wave  propagating 
around  the  surface  of  the  head  rather  than  through  the  volume  of  the  head. 

The  interaction  between  a  mobile  communication  transceiver  and  the  head  was  modelled  using  a  V4  monopole  on 
a  metal  box.  Three  source  geometries  of  the  transceiver  were  considered,  viz  -  vertical  alignment  in  front  of  the  eye  denoted 
APy.  vertical  alignment  at  the  side  of  the  head  by  the  ear  denoted  LAT^  and  horizontal  alignment  at  the  side  of  the  head 
LAT^  (AP  is  short  for  antero-posterior  and  LAT  for  lateral).  The  AP^  geometry  was  chosen  to  simulate  a  person  talking  into 
the  transceiver  and  more  importantly  because  the  feed  point  of  the  antenna,  near  the  surface  of  the  eye,  will  produce  the 
highest  local  SAR.  The  LAT  geometries  represent  other  typical  positions  in  which  the  transceiver  could  be  held.  Figure  2 
shows  a  volume  reconstruction  of  the  2  mm  voxel  head  and  the  transceiver  for  the  LATy  geometry  and  gives  a  good 
impression  of  the  degree  of  realism  that  can  be  achieved  by  models  based  on  medical  imaging  data.  Also  shown  is  the 
placement  of  a  simulated  hand  grasping  the  lower  part  of  the  box. 

Figure  5  shows  the  SAR  across  the  axial  slice  through  the  eyes  for  APy  irradiation  at  900  MHz  with  the  centre  of 
the  box  2cm  away  from  the  surface  of  the  eye.  The  energy  absorption  is  far  more  superficial  than  in  the  plane  wave  case 
of  figure  3  and  most  of  the  energy  is  absorbed  in  the  left  eye.  Figure  6  shows  the  SAR  distribution  for  LATj^  irradiation  with 
the  antenna  2  cm  from  the  surface  of  the  ear.  The  energy  absorption  is  principally  in  the  ear  with  little  penetration  into  the 
brain  and  eye. 

A  complete  description  of  the  head-transceiver  calculations  and  a  discussion  of  the  implications  to  safety  standards 
is  given  in  Dimbylow  and  Mann  (1994), 

5  Prospectus 

A  fine -scaled  2  mm  resolution  voxel  model  of  the  human  male  body  is  being  constructed  at  NRPB.  The  raw  MRI 
data  were  taken  from  a  series  of  continuous  scans  of  the  various  parts  of  the  body  of  a  single  subject.  These  blocks  of  data 
were  then  fixed  together  by  rescaling,  translocation  and  rotation.  The  segmentation  of  the  volume  so  that  each  voxel  is 
ambiguously  assigned  to  a  tissue  type  is  in  progress.  The  completion  of  this  model  will  provide  a  detailed  anatomically 
realistic  phantom  for  comprehensive  dosimetry  of  electromagnetic  fields. 
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FIGURE  1  Axial  slice  through  the  eyes  of  the  2  mm  phantom 


FIGURE  2  Volume  reconstruction  of  the  voxel  head  and  the  transceiver 
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I.  Introduction 

The  NEC  -  Basic  Scattering  Code  (NEC-BSC)  [1]  has  been  used  for  over  20  years  for  the 
analysis  of  high  frequency  antenna  performance  on  complicated  structures  such  as  ships, 
aircraft,  and  missiles.  The  user  interface  to  the  program  consists  of  a  input  file  which 
contains  commands  and  data.  This  classical  mechanism  is  flexible,  transportable  and 
efficient.  In  contrast,  modern  graphical  user  interfaces  provide  very  high  levels  of  user 
interaction  at  the  expense  of  transportability.  The  optimum  solution  is  to  retain  the 
classical  interface  in  the  NEC-BSC  itself,  and  create  a  second,  companion  program  which 
provides  a  graphical  user  interface  to  assist  the  user  in  creating  the  classical  input 
command  file.  The  almost  universal  acceptance  of  Microsoft  Windows  makes  it  the 
natural  choice  for  implementing  a  graphical  user  interface.  This  bifurcation  of  the  codes 
into  a  graphical  user  interface  code  for  generating  the  input  file,  and  an  analysis  code  for 
generating  the  numerical  results  allows  the  analysis  code  to  remain  transportable  to  any 
computer  that  has  a  FORTRAN  compiler.  The  user  can  generate  the  input  file  on  a  PC 
and  then  run  the  NEC-BSC  on  either  the  PC  or  a  more  powerful  computer.  If  the  BSC  is 
run  on  a  different  computer,  then  the  results  can  be  brought  back  to  the  PC  for  viewing, 
plotting,  etc. 

II.  Design  Philosophy 

The  NEC-BSC  Workbench™  is  a  windows  based  environment  for  creating  and 
manipulating  input  files  for  NEC-BSC.  The  user  can  open  an  existing  file  containing 
NEC-BSC  commands  or  create  new  files.  The  input  commands  are  displayed  in  a 
conventional  text  editing  window,  and  the  actual  geometry  is  displayed  in  separate 
wireframe  views.  The  design  philosophy  of  the  Workbench  is  based  on  two  fundamental 
principles.  First,  the  Workbench  is  intended  to  be  an  advanced  command  text  editor  with 
drawing  and  visualization  capabilities  where  the  user  can  create  input  files,  run  NEC-BSC, 
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and  view  the  results  without  ever  leaving  the  Workbench.  It  is  not  intended  to  be  a  CAD 
program.  There  are  many  excellent  CAD  programs  on  the  market  and  there  is  little  to  be 
gained  by  creating  another.  Rather  than  spend  time  and  effort  creating  a  new  CAD 
program  which  merely  duplicates  the  features  of  existing  programs,  the  Workbench  will 
provide  import  and  export  capabilities  to  existing  CAD  programs. 

The  second  design  principle  is  a  requirement  that  the  Workbench  run  on  a  modestly 
equipped  PC.  The  Workbench  does  not  require  massive  amounts  of  memory,  special 
graphics  hardware,  or  the  latest  32  bit  operating  systems.  The  current  version  runs  well 
on  a  486/33  with  Windows  3.1,  4  megabytes  of  memory,  and  standard  VGA  video.  In 
order  to  accomplish  this,  it  was  necessary  to  use  simple  wireframe  drawing,  without 
hidden  lines,  rather  than  more  advanced  graphics  employing  hidden  lines  or  solid  surface 
modeling,  as  shown  in  Figure  1 ,  The  simplicity  of  the  drawing  is  offset  by  the  speed  with 
which  the  drawing  can  be  generated.  For  most  geometries,  the  wireframe  drawing  can  be 
manipulated  in  a  dynamic,  real-time  mode.  Future  versions  of  the  Workbench  will  provide 
solid  surface  model  view  by  using  separate,  non-dynamic  windows.  This  technique  is  not 
new  and  has  been  used  by  CAD  programs  such  as  Autocad  for  many  years.  In  Autocad, 
the  user  performs  all  editing  functions  using  wireframe  view  without  hidden  lines,  and  then 
generates  a  solid  surface  model  only  when  the  drawing  is  complete.  Future  versions  of  the 
Workbench  will  gain  solid  surface  model  capabilities  through  Open-GL.  Open-GL  is 
currently  available  on  Windows  NT  3.5  and  is  expected  to  be  available  in  an  early  release 
of  Windows  95.  Benchmarks  already  performed  using  Open-GL  on  NT  3,5  are  very 
encouraging.  Even  without  special  graphics  hardware,  the  solid  model  performance  is 
surprisingly  good, 

III.  Geometry  Viewing  and  Editing 

Any  number  of  wireframe  views  can  be  created,  allowing  simultaneous  viewing  of  the 
geometry  from  several  angles  as  shown  in  Figure  2.  As  the  user  makes  changes  to  the 
input  file,  these  changes  are  reflected  in  the  wireframe  views.  The  user  can  view  not  only 
the  physical  structure,  but  also  sources,  receivers,  paths  of  sources  and  receivers  in  the 
near  field,  and  the  reference  coordinate  system.  Viewing  options  include  pan,  zoom, 
viewing  angle,  parallel  projections  and  perspective  projections.  By  selecting  a  specific 
command  in  the  input  file,  the  user  can  load  the  entire  command  into  a  dialog  box  and  thus 
obtain  very  high  levels  of  user  interaction.  The  user  can  also  accomplish  dialog  box 
editing  of  commands  by  picking  a  geometry  item  from  one  of  the  wireframe  views  and 
double  clicking  on  that  item.  A  typical  editing  dialog  and  its  associated  command  text  is 
shown  in  Figure  3. 

IV.  Ray  Viewing  and  Editing 

In  a  new  version  of  the  NEC-BSC,  an  option  to  output  the  ray  paths  and  field  information 
is  implemented.  The  NEC-BSC  Workbench  can  read  these  files  and  display  the  ray  paths 
superimposed  on  the  wireframe  views  of  the  geometry.  For  complicated  problems,  where 
the  number  of  rays  is  large,  the  Workbench  provides  various  filters  for  selecting  which 
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rays  will  be  drawn.  Rays  can  be  filtered  based  on  the  number  of  interactions  or  by  relative 
power.  The  Workbench  also  supports  OLE2  drag  and  drop  for  moving  rays  from  one 
wireframe  view  to  another,  and  thus  allows  the  user  to  edit  the  ray  representations  in  any 
fashion.  The  user  can  obtain  information  about  a  particular  ray  by  selecting  and  double 
clicking  on  that  ray.  Figure  4  shows  a  typical  representation  of  a  corner  reflector  with  a 
near  field  source  and  field  point.  The  different  windows  show  rays  that  have  interacted 
with  the  comer  reflector  from  0  to  3  times. 

V.  Other  Capabilities 

Finally,  the  Workbench  allows  the  user  to  view  and  print  large  output  files  produced  by 
NEC-BSC.  Future  versions  will  include  plotting  capabilities.  At  present,  the  ASCII 
output  files  produces  by  NEC-BSC  can  be  read  by  most  standard  spreadsheet  type 
plotting  programs.  The  Workbench  was  designed  to  work  closely  with  the  NEC-BSC, 
almost  as  if  it  was  part  of  the  BSC  rather  than  a  separate  program.  Using  a  button  on  the 
toolbar,  the  user  can  launch  a  special  windows  version  of  NEC-BSC.  This  will  cause  the 
current  input  file  to  be  automatically  saved  and  submitted  to  the  BSC  for  analysis.  Since 
the  BSC  runs  as  a  separate  program,  the  user  can  continue  to  edit  and  view  other  input 
files  while  the  BSC  is  executing. 

VI.  References 
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Figure  3.  Example  of  a  command  dialog.  This  example  shows  the  dialog  box  which 
encapsulates  the  source  command.  The  actual  command  file  text  of  the  is  included  for 
reference.  The  dialog  box  can  either  be  invoked  by  double  clicking  on  the  line  of  text 
containing  the  SG;  command  or  by  double  clicking  on  the  source  in  a  wireframe  view. 
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Figure  4.  Comer  reflector  showing  rays  from  source  to  near  field  point.  Upper  left 
window  shows  all  rays.  Other  windows  show  rays  that  1,  2,  and  3  times  respectively. 
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ABSTRACT.  A  comprehensive  antenna  and  propagation  modeling  code  for  the  Medium  Frequency 
(MF)  band  has  recently  been  developed.  The  code  was  developed  under  Windows'''^  3.1  on  a  80486 
PC  architecture  to  provide  arrayed  antenna  analyses,  and  use  the  results  to  perform  ground-wave  and 
skywave  propagation  analyses.  A  sophisticated  user-friendly  graphical  interface  allows  quick  and 
easy  specification  of  inputs  and  displays  the  results  in  easy  to  understand  formats.  Typical  outputs 
include  far  field  radiation  plots  and  color-coded  contour  plots  that  can  be  easily  inserted  into 
documents.  Extensive  on-line  help  system  and  error  checking  eliminate  the  most  common  errors, 
such  as,  invalid  users  input. 


INTRODUCTION. 

Numerical  simulation  of  wireless  communication  and  broadcast  systems  is  an  art,  left  to  engineers 
and  scientists  with  many  years  of  experience.  The  Communication  Engineering  Laboratory  (CEL) 
simplifies  user  access  to  this  art  with  modern  graphical  user-interface  (GUI)  tools.  Three 
electromagnetic  antenna  modeling  (EAM)  codes  for  quick  and  intensive  antenna  analysis  were 
introduced  a  few  years  ago.  Next,  lONWIN,  a  GUI  for  lONCAP  [1]  was  developed  to  simplify  the 
simulation  of  high  frequency  skywave  propagation.  This  past  year,  CEL  developed  an  all  inclusive 
code  to  simulated  MF  communication  and  broadcast  predictions  called  MF  Works. 

The  MF  Works  code  provides  a  comprehensive  system  for  modeling  medium  frequency  antennas, 
then  uses  the  results  for  input  to  ground-wave  propagation  analysis.  To  complete  the  toolset,  skywave 
analysis  was  also  implemented.  This,  too,  uses  the  antenna  results  as  an  input.  Each  of  these 
computational  modules  has  a  custom  designed  GUI  for  input  data  definition  and  visualization  of  the 
resultant  output. 

Key  features  of  MF  Works  are  it's  map  windows.  During  input  definition,  the  map  window  assists 
in  specifying  the  geographical  coordinates  of  the  nodes,  and  during  output,  contour  levels  are 
superimposed  on  the  map.  The  map  window  is  based  on  the  Defense  Mapping  Agency’s  (DMA)  high 
resolution  map  database  and  can  be  used  to  display  any  location  on  the  planet.  Three  different 
resolutions  of  the  DMA  map  database  are  used  to  decrease  disk  access  time.  To  simplify  operation, 
the  map  window  contains  a  Tool  Bar,  Status  Bar,  and  Scroll  Bars.  Output  data  is  displayed  as  colored 
contours  superimposed  on  maps  for  the  ground-wave  and  skywave  analyses. 
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THEORETICAL  BACKGROUND. 


Magnitude  and  phase  of  applied  current  to  a  directional  antenna  system  controls  the  directive 
pattern’s  lobes  and  nulls.  In  the  broadcast  world,  poorly  directed  lobes  or  high  nulls  can  cause 
interference  with  established  service  areas  of  other  stations  or  waste  energy  over  unpopulated  areas. 
Once  the  energy  is  radiated  from  the  directional  antenna  system,  the  distance  it  travels  while 
maintaining  a  useful  signal  level  is  very  important.  Energy  is  radiated  into  a  space-wave  and  surface- 
wave  along  the  earth/air  boundary.  The  space-wave  propagates  in  a  spherical  wavefront  attenuating  at 
1/r^,  whereas  the  surface-wave  propagates  as  a  cylindrical  wavefront  and  therefore  attenuates  at  1/r.  If 
we  just  consider  wavefront  expansion  attenuation,  then  the  surface-wave  should  always  outperform 
the  space-wave  with  terrestrial  communication  and  broadcast.  The  surface-wave  has  an  additional 
loss  factor  caused  by  currents  heating  the  boundary’s  media  or  the  ground’s  constitutive  parameters. 
High  conductivity  do  not  impede  current  flow  and  allow  a  surface  wave  to  propagate  thousands  of 
kilometers,  on  the  other  hand,  low  conductivity  can  limit  propagation  to  tens  of  kilometers. 

The  standard  terms  used  in  communication  theory  are  skywave  and  ground-wave.  Skywave  refers 
to  space-waves  that  are  refracted  by  the  Ionospheric  and  ground-wave  is  composed  of  the  space-  and 
surface-waves,  where  the  space-wave  components  include  the  direct  (line-of-sight),  ground-reflected, 
ridge-diffracted,  tropospheric-refracted, 

tropospheric-scattered,  and  ducted  modes,  as 
shown  in  Figure  1.  The  surface-wave  is 
composed  of  propagating  electromagnetic  waves 
bound  to  the  earth’s  surface  and  refracted  in  the 
lower  atmosphere.  The  dominant  propagation 
modes  that  comprise  any  ground-wave  link  are 
determined  by  the  signal  frequency,  antenna 
patterns,  and  the  propagation  environment, 
including  terrain  and  climate.  In  the  MF  band, 
the  dominate  ground- wave  term  is  the  surface- 
wave  bound  to  the  earth/air  interface.  The 
propagation  environment  still  determines  the 
distance  a  radiowave  can  propagate  while 
remaining  above  the  noise  floor. 

Assuming  a  fixed  frequency,  then  conductivity  of  the  interface  along  the  propagating  path  is 
primarily  responsible  for  propagation  distance.  Conductivity  typically  varies  from  10  S/m  for  dry 
sandy  desert,  to  5  S/m  for  salt  water.  Low  conductivity  limits  the  current  flow  in  the  ground  required 
by  the  vertically-polarized  electric-field  component,  the  dominant  polarization  component  of  surface- 
wave  propagation  [2].  In  desert  climates,  these  effects  are  experienced  as  a  significant  reduction  in 
effective  ground-wave  range  to  just  beyond  the  radio  Line-Of-Sight  (LOS)  [3].  Typically,  MF 
propagation  will  extend  over  varying  constitutive  parameters.  Therefore,  accurate  ground-wave 
analyses  must  be  capable  of  analyzing  a  combined  path  of  multiple  constitutive  parameters.  MF 
Work’s  ground-wave  analysis  performs  predictions  over  both  constant  homogenous  ground  and  a 
combined  path  with  up  to  four  ground  constitutive  parameters  changes  on  each  radial  path. 
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Figure  1.  Surface-Wave  and  Space-Wave  Components 
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ANTENNA  ANALYSIS. 


Typically,  a  resonance  quarter- wave  monopole  at  one  megahertz  is  75  meters  high  with  a  good 
ground  screen  and  no  loading.  Multiple  monopoles  of  this  size  are  commonly  arrayed  to  shape  the 
desired  coverage  pattern.  The  MF  Works  antenna  analysis  supports  arrays  of  monopole  antenna  on 
good  ground  screens.  Array 
data  are  specified  using  a 
customized  input  dialog  window 
with  all  parameters  referenced 
to  the  first  antenna  as  shown  in 
Figure  2.  A  minimum  of  one 
antenna  is  required  and  would 
result  in  an  omni-azimuthal 
coverage  pattern.  A  maximum 
of  five  antennas  can  be  arrayed 
to  create  a  complex  directional 
coverage  pattern. 

Once  an  antenna  analysis  has 
been  performed,  its  results  are 
displayed  in  a  polar  radiation 
plot,  as  shown  in  Figure  3.  The 
plot  displays  azimuth  cuts  of  the 
antenna's  three-dimensional 
power  gain  pattern  using  a  user- 

defined  elevation  angle.  The  output  data  are  plotted 
versus  compass  angles,  the  top  of  the  plot  is  North  (0°), 
the  right  is  East  (90°),  the  bottom  is  South  (180°),  and 
the  left  is  West  (270°).  This  is  contrary  to  the  customary 
right  hand  rule  used  in  antenna  theory,  but  provides  a 
more  intuitive  visual  understanding  of  the  antenna’s 
pattern  relative  to  the  maps  used  in  ground-wave  and 
sky  wave  analyses. 

GROUND-WAVE  ANALYSIS. 

As  stated  earlier,  the  ground-wave  in  the  MF  band  is 
predominately  a  surface-wave  component  bound  to  the 
earth/air  interface.  The  following  analysis  only  considers 
that  propagating  surface-wave  component.  Input 
parameters  are  easily  specified  using  a  customized  dialog 
window  as  shown  in  Figure  4. 
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The  antenna 
filename  specifies  a 
previously  run  MF 
Work’s  antenna 
analysis  file  located 
at  the  geographic 
coordinates  entered 
in  the  location 
entry.  These 

geographic 
coordinates  can  be 
entered  by  typing  in 
the  longitude  and 
latitude  or  by  using 
the  map  window. 

The  boresight 
bearing  angle, 
specified  in  degrees 
East  of  North, 
allows  specifying 
compass  direction 

bearing  in  which  the  antenna  is  pointed.  Frequency  is  entered  in  units  of  MHz  and  must  be  between 
0.3  and  3.0  MHz.  Power  is  entered  in  units  of  kilowatts  (kW)  and  must  be  between  0  and  10,000. 

The  receive  grid  dimensions  are  used  to  specify  a  rectangular  area  of  receive  locations  over  which 
predictions  are  to  be  performed.  Width,  height,  and  resolution,  all  in  units  of  geographical  degrees  are 
required.  MF  Works  automatically  executes  ground-wave  analyses  for  each  resolution  grid  point.  If 
the  expected  ground-wave  propagation  distance  with  useable  signal  power  in  uncertain  then  over 
specify  the  receiver  area  and  use  a  course  resolution,  this  will  speed  up  the  analysis  time.  Then, 
respecify  the  area  of  interest  and  use  a  finer  resolution  for  smoother  output  contours. 

Ground-wave  propagation  distances  are  highly  dependent  on  the  ground’s  constitutive  parameters. 
For  homogeneous  ground  there  are  two  fields  to  specify:  conductivity  (mS/m)  [o]  and  relative 
permittivity  (unitless)  [er].  Conductivity  typically  varies  from  10  S/m  for  dry  sandy  soil  to  10  for 
average  soil  to  5  S/m  for  sea  water.  Relative  permittivity  typically  varies  from  1  for  air  to  3  for  dry 
sand  to  15  for  average  soil  to  81  for  seawater  [4]. 

MF  Works  also  performs  ground-wave  analysis  involving  varying  ground  constitutive  parameters. 
The  path  analysis  of  successively  changing  constitutive  parameters  requires  specifying  radials  paths 
anchored  at  the  transmitter  location.  The  amount  of  input  data  requires  specification  using  an  external 
data  file,  as  shown  in  Figure  5.  Each  radial  path  has  a  maximum  of  4  different  constitutive 
parameters.  For  example,  at  a  30°  angle,  the  combined  path  has  two  ranges;  the  first  120  km  from  the 
antenna  is  land  and  then  changes  to  sea  water.  At  40°,  land  extends  for  100  km,  then  changes  to  salt 
marsh  out  to  160  km,  then  to  sea  water.  The  last  radial  defined  is  automatically  repeated  at  the  same 
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Figure  4.  Dialog  window  for  ground-wave  input  data. 
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degree  increment  to  completely  define  360  degrees,  therefore  land  exists  in  all  other  directions  from 
80°  to  360°,  and  from  0°  to  20°. 
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Figure  5.  External  text  file  required  for  changing  constitutive  parameters  over  radial  ground  paths. 


The  ground-wave  signal 
strength  attenuation  is 
calculated  by  a  series  of 
summation  equations  as  a 
function  of  frequency, 
conductivity,  and  relative 
permittivity.  As  shown  in 
Figures  6a,b,c,  this 
formulation  agrees  well  with 
FCCGW  [5]  (Bremmer’s 
extension  to  Sommerfelds 
models  [6,7]).  The  only 
exception  is  past  200  km  over 
salt  water.  MF  Work’s 
computation  seems  to  be  on 
the  conservative  side. 

In  the  past,  text-based 
computer  codes  only  provided 
the  numerical  value  for 
attenuation  versus  distance. 
The  customary  initial  electric 
field  strength  is  set  to  a  fixed 
value  at  i  mile.  This  is 
electric  field  delivered  from  a 
short  lossless  vertical  radiator 
(SLVR)  radiating  energy 
equally  in  ail  directions  along 
the  boundary.  It  was  left  to 
the  engineer  to  add  or  subtract 
antenna  performance  after 
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ground-wave  computations 
were  completed.  In  reality, 
a  sophisticated  set  of 
antennas  are  arrayed  to 
provide  the  ideal  coverage 
pattern  which  is  far  from 
omni-directional.  MF 
Works  uses  the  computed 
electric  field  value  from  the 
earlier  antenna  analysis, 
specified  by  the  antenna 
filename,  to  provide  the 
initial  electric  field  values 
along  each  radial  at  1  mile. 
Linking  antenna  analysis 
results  with  the  ground- 
wave  analysis 
alleviates  a  lot  of 
tedious  work  for 
the  engineer.  It 
also  results  in  an 
immense  amount 
of  output  data. 

This  data  can  be 
easily  viewed  as 
color  contours 
superimposed  on 
to  a  geographic 
map,  as  shown  in 
Figure  7.  The 
map  window  has 
several  options  to 
better  view  the 
results,  zoom  in 
and  out,  add  a 
title,  increase  or 
decrease  the 

number  of 

contours,  and 
modify  the 

contour  values. 
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Figure  6c.  MF  Works  and  FCCGW  over  sr=81,  ct=4.64  S/m. 
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SKYWAVE  ANALYSIS. 


Energy  radiated  from  an  antenna  launches  a  ground-wave  along  the  earth/air  boundary  and  also 
launches  a  space  wave  component.  Electrons  in  the  ionosphere  cause  frequency  selective  bending  of 
electric  and  magnetic  fields  as  they  propagate  through  different  electron  densities.  The  selective 
bending  causes  these  propagating  fields  to  return  to  earth  at  varying  distances  from  the  transmitting 
antenna.  The  MF 
Works  skywave 

analysis  computes 
these  returned  electric 
field  magnitudes 

versus  distance  and 
displays  them  as 
colored  contours 

superimposed  on  a 
map. 

Skywave 

parameters  are 

specified  using  a 
customized  dialog 
window  similar  to  the 
ground-wave  dialog. 

Outputs  are  displayed 
as  colored  signal  level 
contour  maps,  as 
shown  in  Figure  8. 


SUMMARY. 

A  program  now  exists  to  link  antenna  analysis  with  ground-wave  and  skywave  analysis  from  within  a 
sophisticated  graphical  user-interface.  A  user  with  only  a  basic  understanding  of  PCs  and  Windows™ 
will  be  able  to  use  the  GUI  to  predict  the  performance  of  a  medium  frequency  communication  or 
broadcast  system.  The  GUI  allows  quick  and  easy  specification  of  inputs  and  displays  the  results  in 
easy  to  understand  formats.  Outputs  includes  far  field  radiation  pattern  plots  and  color-coded  contour 
plots  that  can  be  easily  inserted  into  documents.  Extensive  on-line  help  system  and  error  checking 
eliminate  the  most  common  errors,  such  as,  invalid  users  input. 

CEL  continues  to  develop  numerical  tools  to  alleviate  the  tedious  task  of  linking  antenna  modeling 
data  with  propagation  analysis.  Several  algorithms  which  modify  the  Numerical  Electromagnetic 
Code’s  (NEC)  output  files  to  a  compatible  input  format  for  lONCAP,  ICEPAK,  and  VOACAP  have 
been  developed.  CEL  has  developed  five  other  numerical  tool’s  for  the  analysis  of  antennas  and 
propagation.  An  overview  of  these  other  tools  can  be  found  in  reference  [8]. 
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All  codes  were  designed  for  operation  on  an  IBM-compatible  personal  computer  (PC).  Minimum 
system  requirements  include  an  80386-based  PC,  MS-DOS  version  5.0,  and  Windows™  version  3.1  or 
later  operating  in  the  enhanced  mode.  The  minimum  peripheral  hardware  requirements  include  a 
mouse,  keyboard,  four-Megabytes  random  access  memory,  five-Megabytes  hard-disk  storage  capacity, 
and  VGA  graphics  capability.  Optional  SAIC-modified  DMA  map  database  (geopolitical  boundaries 
only)  requires  115  Megabytes  of  hard-disk  storage  space.  All  modules  include  help  systems,  error 
checking,  printing,  plus  all  Window's  features.  MF  Works  is  written  in  Borland  C++  for  Windows, 
version  4.0. 
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Abstract 

A  user-friendly  graphical  interface  code  is  developed  for  the  study  of  electromagnetic  interaction  with 
complex  geometries  using  the  Finite  Difference  Time  Domain  method.  This  code  simplifies  the 
generation  of  complex  scattering  geometries  through  the  use  of  basic  shapes  such  as  spheres,  cubes, 
ellipsoids,  finite  cylinders,  and  finite  strips.  The  visualization  of  a  source  plane  wave  represented  as  a 
Gaussian  pulse  is  also  important  to  determine  the  valid  frequency  range.  The  simulation  is  visualized 
through  the  selection  of  one  of  the  field  components  in  a  plane  cut  specified  by  the  user. 

Introduction 

The  availability  of  a  more  powerful  generation  of  personal  computers  at  low-end  workstation  prices 
makes  higher  level  3D  electromagnetic  codes  possible  at  lower  costs.  The  study  of  the  interaction  of 
electromagnetic  waves  with  complex  scatterers  is  a  difficult  task  that  involves  computational  methods 
such  as  the  Finite  Difference  Time  Domain  (FDTD)  method.  Visualization  tools  previously  exclusive 
to  high-end  graphics  workstations  such  as  the  FDTD  visualization  tool  for  the  SGI  [1]  can  be  used  on 
current  486  and  Pentium  based  PC's  equipped  with  high  speed  graphics  cards.  However,  building  up 
the  cells  into  the  FDTD  simulation  space  is  a  difficult  process  for  practical  problems.  Therefore,  a 
user-friendly  graphical  interface  is  developed  and  briefly  described  here  for  the  analysis  of  an 
unlimited  number  of  electromagnetic  problems  through  interactive  real  time  visualization  of  the  field 
components  using  FDTD.  The  FDTD  code  is  adapted  from  Kunz  and  Luebbers  [2].  The  source 
excitation  currently  supported  is  a  Gaussian  pulse  plane  wave. 

Program  Startup 

Startup  of  the  program  shows  three  windows  as  shown  in  Fig.  1 .  The  top  window  is  used  to  design  the 
geometry  and  the  bottom  left  window  shows  a  parallel  perspective  projection  of  the  3D  geometry.  The 
bottom  right  window  is  used  to  show  an  animated  plane  cut  of  one  of  the  six  cartesian  field 
components. 

Geometry  Construction 

Before  defining  the  scattering  geometiy^  the  user  must  define  the  FDTD  space  that  contains  the 
geometry.  Selection  of  the  "New"  option  under  the  "File"  menu  will  open  the  space  parameters 
window  as  shown  in  Fig.  2. 
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Once  the  space  is  defined,  the  user  may  proceed  to  add 
objects.  The  basic  shapes  available  are  spheres,  cubes, 
ellipsoids,  finite  cylinders,  or  finite  strips.  These  shapes  are 
added  by  selection  of  the  corresponding  name  in  the 
'Generate'  menu.  A  placement  window  is  opened  for  the 
selected  shape.  The  object  location  and  dimensions  are  then 
defined,  along  with  an  optional  name  for  the  particular 
object.  As  an  example,  Fig.  3  shows  the  sphere  placement 
window  in  which  the  sphere  center  location  and  radius  are 
to  be  defined. 

The  cube  placement  requires  the  offset  location  and  x,  y,  z 
dimensions  while  the  ellipsoid  placement  requires  the 
ellipsoid  center  and  major  radii  parallel  to  the  x,  y,  and  z 
axes.  All  offsets  must  be  given  from  the  global  origin 
(0,0,0).  The  cylinder  placement  window  that  is  presented  in 
Fig.  4  is  more  complex.  Notice  that  the  cylinder  is  defined 
by  the  length  of  its  axis,  the  radius  of  the  cylinder,  the  offset 
of  the  starting  point  of  the  its  axis,  and  axis  orientation.  The 
strip  placement  window  as  shown  in  Fig.  5  may  appear  quite 
complex.  How^ever,  note  that  the  strip  should  be  in  one  of 
the  xz,  yz,  or  xy  planes.  Once  the  plane  of  the  strip  is 


selected,  the  direction  of  the  length  related  to  the 
selected  plane  should  be  chosen.  For  example,  a  strip 
in  the  xy  plane  could  have  its  length  in  the  x  or  y 
direction. 

Once  the  objects  are  all  placed,  the  parallel  perspective 
view  wdndow  will  show  a  3D  representation  of  a 
constructed  model,  as  demonstrated  by  the  aircraft 
shown  in  Fig.  6.  The  design  window  will  show  the  3 
views  of  the  space,  along  with  a  list  of  the  object  names 
and  types,  as  shown  in  Fig.  7.  This  geometry  is  an 
adaptation  of  a  generic  aircraft  similar  to  that  in  [3]. 
Notice  that  the  list  of  material  names  with  their 
permittivity  and  conductivity  parameters  are  obtained 


Fig.  6:  View  of  geometry. 


Fig.  7:  Edit  views  and  object  list  of  the  design  window. 
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from  Appendix  G  in  [4].  Items  in  this  list  may  be  changed,  except  for  the 
conductor  and  free  space.  "Double-click"  any  item  and  a  material  type 
window  will  appear  as  shown  in  Fig.  8. 

Source  Excitation 

Another  important  part  of  the  FDTD  simulation  is  the  parameters  of  the 
source  excitation.  The  maximum  value  of  the  time  step  (At)  and 
maximum  usable  frequency  are  computed  from  Couranfs  stability 


Fig,  8:  Material  editor. 


condition. 

The  user  may  change  the  parameters  for  a 
Gaussian  pulse  plane  wave  by  selecting  the 
"Gaussian  Pulse"  option  from  the  "Source" 
menu.  This  calls  up  the  Gaussian  parameters 
window  as  shown  in  Fig.  9.  The  "Inside  space" 
option  is  for  setting  a  Gaussian  pulse  type  of 
excitation  in  a  plane  or  at  a  point  as  a  source 
inside  of  the  defined  space.  The  default  "Plane 
Wave"  option  is  currently  supported.  The 
definition  of  the  incident  angles  should  be  the 
same  as  the  viewing  angles  of  the  parallel 
perspective  view  of  the  3D  geometry.  The 
Gaussian  pulse  as  described  by 

=  (1) 
requires  input  parameters  p,  At,  and  the  time 
delay  t.  The  default  value  for  p  is  given  as  64 
and  At  is  computed  by  the  Courant  stability 
condition 
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where  c  is  the  speed  of  light,  and  Ax,  Ay,  and 
Az  are  cell  dimensions  in  the  x,  y,  and  z 
directions,  respectively.  Finally, 

is  required  to  obtain  a  truncation  of  the  pulse  at 
nearly  -140  dB  from  its  maximum  amplitude.  Fig.  9.  Gaussian  pulse  parameters. 

Selection  of  the  "View  Pulse"  button  brings  up  the  "Gaussian  Pulse"  plot  window  of  Fig.  10.  From 
this  view,  the  user  can  get  an  idea  of  the  valid  frequency  range  for  the  simulation.  The  user  may  close 
this  window,  change  the  parameters  of  p  and  At,  and  then  select  the  "View  Pulse"  button  again  to  see 
the  effect  of  the  new  parameters.  The  valid  frequency  range  is  considered  for  a  pulse  'with  frequencies 
that  correspond  to  amplitude  levels  not  less  than  -140dB  in  the  frequency  plot.  The  total  time  shown 
as  the  maximum  value  of  the  x  axis  on  the  time  plot  is  computed  from  the  total  number  of  time  steps 
given  in  the  simulation  parameters  window.  The  amplitude  is  computed  from  equation  (1)  and  the 
frequency  response  is  computed  by  applying  a  fast  Fourier  transform  routine  [5]. 
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Simulation 

The  FDTD  computations  begin  by 
selection  of  the  ’’start"  option  in  the 
"simulation"  menu.  However,  the  user 
may  wish  to  change  the  default 
simulation  parameters,  by  selecting  the 
"parameters"  option  in  the  "simulation" 
menu  to  get  the  simulation  parameters 
window  as  shown  in  Fig.  11.  From  this 
window,  the  user  can  add  sample  points 
and  sample  lines.  The  sampled  points 
and  lines  are  recorded  in  files.  The 
animation  window  parameters  are 
available  in  this  window.  The  animation 
is  a  color  contour-type  plot  of  the 
selected  field  components.  Selection  of 
the  desired  field  component  is  done  by 
selecting  the  appropriate  radio  button. 
The  selected  field  component  is  denoted 

by  a  filled  circle.  When  two  or  more 
test  points  or  test  lines  exist,  a  scroll  bar 
will  appear  for  that  group  to  allow  the 
user  to  scroll  through  all  of  the  test 
points  or  test  line  selections.  After  all 
the  desired  simulation  parameters  are 
selected,  the  user  may  begin  the 
simulation. 

After  selecting  the  "start"  option,  the 
simulation  window  will  show  the 
selected  plane  cut,  with  any  geometry 
cross-section  at  that  plane  dravm  in 
white.  Other  parameters  printed  in  the 
simulation  window  are  the  selected 
scattered  field  component,  the  plane  cut 
with  its  location,  the  current  time  step, 
and  the  corresponding  simulation  time. 
At  the  bottom  of  the  simulation  is  a 
color  scale  showing  the  minimum  and 
maximum  amplitudes  for  the  scale.  Two 
examples  of  the  simulation  screen  are 
shown  in  Figs.  12  and  13  where  all 
objects  are  selected  as  ceramic.  The  simulation  of  a  similar  aircraft  geometry  made  of  perfectly 
conducting  material  is  shown  in  Fig.  14. 


Fig.  10:  Gaussian  pulse  plot  window 
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introduction 

A  problem  common  to  the  development  and  maintenance  of  most 
computational  electromagnetics  (CEM)  codes  is  validation.  This 
problem  becomes  particularly  acute  for  large  codes  involving  numerous 
modules  and  program  paths.  The  best  kind  of  CEM  code  validation  is 
good  agreement  between  calculated  results  and  measured  data  of  known 
accuracy.  However,  it  is  usually  difficult  or  impossible  to  relate 
differences  observed  between  the  CEM  code  output  and  measured  data 
to  specific  limitations  within  the  code  quantitatively  or  even 
qualitatively.  The  same  problem  occurs  when  comparing  calculated 
results  for  a  standard  problem  using  several  CEM  codes. 

During  the  past  year,  Rome  Laboratory  has  constructed  a  novel  test 
article  called  the  Transformable  Scale  Aircraft-Like  Model  (TSAM) 
that  is  meant  to  be  used  as  a  CEM  code  validation  tool.  Because  TSAM 
is  transformable,  its  configuration  can  be  modified  at  will  allowing 
measurements  to  be  made  on  a  variety  of  profiles  in  contrast  to  a 
static  test  article.  Initially,  TSAM  is  designed  to  accurately  mirror 
the  geometry  modeling  capabilities  of  a  number  of  CEM  codes  and,  as 
a  result,  it  holds  great  promise  for  allowing  algorithm  and  modeling 
errors  embedded  within  a  CEM  code  to  be  estimated  qualitatively  and 
even  quantitatively  using  measurements  on  the  TSAM  as  a  guide.  Errors 
then  can  be  adjusted  as  necessary  to  achieve  a  given  level  of  code 
simulation  accuracy.  TSAM  was  designed  primarily  to  study  platform 
effects  on  the  radiation  pattern  of  a  co-located  antenna  (radiation 
problem)  but  also  can  be  used  to  study  platform  scattering  problems 
if  desired. 

In  this  paper,  a  quick  overview  of  the  TSAM  concept  is  presented 
followed  by  a  description  of  the  TSAM  configuration  (including  the 
antennas  exciting  the  structure) .  A  representative  set  of  measured 
antenna  patterns  and  antenna  isolation  results  is  given  along  with 
an  error  analysis  and  accuracy  estimates.  A  description  of  the  Rome 
Laboratory  anechoic  chamber  measurement  facilities  also  is  included. 

The  TSAM  Concept 

When  the  computed  results  from  a  CEM  code  are  compared  to 
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measurement,  differences  are  always  present.  These  differences  are 
due  to  the  following  factors:  1)  limitations  within  the  code  on 
modeling  the  geometry  and  physics  of  the  problem  (modeling  error) ; 
2)  limitations  in  the  equations  used  by  the  code  to  describe  the 
electromagnetic  phenomena  and  in  their  specific  implementation 
(algorithm  error) ;  3)  limitations  in  computational  capabilities  of 
the  computer  platform  hosting  the  code  (computation  error) ,  and,  4) 
limitations  in  measurement  accuracy  (measurement  error) .  The 
combination  of  these  errors  represents  the  difference  observed 
between  computation  and  measurements  (simulation  error) .  Individual 
errors  usually  are  not  known  and  may  be  mutually  dependent.  If 
computation  error  is  insignif igent  and  measurement  error  is  known, 
simulation  error  then  represents  the  modeling  and  algorithm  errors. 

The  novelty  of  TSAM  is  making  measurements  on  a  scale  aircraft¬ 
like  model  that  is  a  duplicate,  in  both  form  and  substance,  of  the 
model  in  the  code.  Differences  between  measured  and  computed  results 
then  is  attributable  to  algorithm  error  since  measured  and  computed 
models  coincide.  These  differences  now  can  point  to  specific  causes 
within  the  CEM  code.  TSAM  is  composed  of  replaceable  components 
(including  antennas)  chosen  to  coincide  with  the  current  modeling 
capabilities  of  common  CEM  codes.  The  goal  is  for  the  TSAM  components 
and  configuration  to  evolve  and  mature  over  time  into  a  realistic 
airframe  with  antennas  with  detail  added  incrementally  to  allow  an 
assessment  of  the  impact  of  each  component  to  be  made. 

TSM  Initial  Configuration!  Structure  and  Mounted  Antennas 

The  TSAM  design  described  here  was  chosen  to  conform  roughly  to  a 
1/20  scale  model  of  a  wide-bodied  aircraft.  TSAM  is  made  of  aluminum 
and  was  fabricated  into  several  canonic  shapes  to  represent  wings, 
fuselage,  stabilizers,  engine  nacelles  and  pylons.  Each  canonic  shape 
can  be  modeled  very  accurately. 

The  current  TSAM  configuration  is  shown  in  Figure  1.  The  fuselage 
is  a  hollow  elliptical  cylinder  approximately  93  inches  long  with 
diameters  8.5  and  7.5  inches  respectively.  The  cylinder  is  capped 
at  both  ends  forming  a  blunt  nose  and  tapered  end.  Two  thin  flat 
plates  serve  as  wings  with  rectangular  aluminum  supporting  plates 
underneath  to  hold  the  wings  rigid.  The  wingspread  is  76  inches. 
Three  thin  plates  form  the  horizonal  and  vertical  stabilizers.  Four 
capped  circular  cylinder  structures  represent  the  engine  nacelles  and 
are  attached  to  the  wing  structures  by  four  thin  plates  which  serve 
as  pylons.  All  components  are  hinged  and  individually  removable.  Each 
component  geometry  can  be  modeled  accurately  and  the  metallic 
structure  is  assumed  perfectly  conducting.  The  thin  plates  serving 
as  wings  are  hinged  in  order  to  vary  the  wing  cant.  Fuselage  rigidity 
is  maintained  by  an  I-beam  positioned  inside  the  cylinder. 
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Figure  1.  Initial  TSAM  configuration 


Six  thin  wire  monopole  antennas,  three  located  directly  on  top  of 
the  cylinder  and  three  on  the  bottom,  are  shown  schematically  in 
Figure  1.  Because  each  antenna  is  only  15  mm  long  they  are  shown  not 
to  scale.  Each  antenna  is  labeled  numerically  and  positioned  a 
certain  distance  away  from  the  blunt  nose  as  shown  in  Figure  1.  The 
I-beam  serves  to  position  and  support  the  mounted  antennas  and  their 
feed  structure, 

TSAM  Antenna  Pattern  Measurement  Facilities  and  Procedures.: 

All  antenna  radiation  pattern  measurements  were  performed  in  the 
large  anechoic  chamber  located  in  the  Rome  Laboratory  Electromagnetic 
Environmental  Effects  Research  Center.  The  chamber  is  designed  to 
provide  a  20  cubic  foot  quiet  zone  in  the  center  of  the  chamber.  A 
functional  block  diagram  of  the  facility  set  up  to  determine  antenna 
pattern  measurements  is  shown  in  Figure  2. 

Gain  calibrated  TSAM  antenna  radiation  pattern  data  was  obtained 
by  rotating  the  TSAM  test  article  in  an  incident  field  radiated  from 
a  standard  gain  horn,  at  a  fixed  frequency,  while  recording  the 
amplitude  of  power  received  on  each  test  antenna  as  a  function  of 
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Figure  2.  Antenna  Pattern  Measurement  System  Functional  Diagram 


positioner  angle.  Pattern  cuts  corresponding  to  the  roll,  pitch  and 
yaw  (azimuth)  principal  planes  were  measured.  Gain  calibration  was 
obtained  through  a  standard  gain  antenna  substitution  method  combined 
with  RF  cable  insertion  loss  calibration.  The  S/A  58000  Series  model 
tower  with  rotary  joint  provided  proper  rotation  and  orientation  of 
TSAM  during  antenna  pattern  data  collection.  A  low  dielectric 
constant  material  was  used  for  the  tower  and  rotating  head  drive 
shaft  to  reduce  reflections  while  the  base  of  the  model  tower  was 
covered  with  RF  absorber  material. 

The  RF  incident  field  was  provided  by  a  synthesized  signal  source 
at  the  frequencies  4.5,  6.0  and  8.0  GHz.  This  source  was  connected, 
as  shown  in  Figure  2,  to  an  American  Electronic  Laboratory  (AEL) 
H5200  standard  gain  horn  antenna  mounted  on  a  telescoping  transmit 
tower  near  the  chamber  balcony.  The  TSAM  antennas  were  connected  to 
the  receiver  and  recording  systems,  as  shown  in  Figure  2,  and  the 
received  amplitude  signals  were  recorded  as  a  function  of  angular 
position  for  each  frequency. 

TSAM  Antenna  Isolation  Measurement  Facilitie3_and  Proceduregl 

The  same  anechoic  chamber  facility  was  used  to  perform  the  TSM 
antenna  isolation  measurements  but  with  a  different  test  setup  as  is 
shown  schematically  in  Figure  3. 
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Figure  3.  TSAM  Antenna  Isolation  Measurement  Block  Diagram 


TSAM  was  first  positioned  on  RF  transparent  foam  blocks  and  set  in 
the  chamber.  Isolation  measurements  then  were  performed  between 
all  TSAM  antenna  pairs.  For  a  given  pair,  one  antenna  was  chosen 
to  transmit  an  RF  signal  and  the  other  one  to  receive  using  the  RF 
switch  shown  in  Figure  3.  The  transmit  and  receive  signals  were 
connected  respectively  to  the  SI  and  S2  ports  of  an  HP  8510 
Network  Analyzer  and  S12  measurements  were  performed  for  the 
transmit  antenna  and  each  receive  antenna  and  reference  line  over 
the  frequency  range  4-8  GHz  using  the  RF  switch.  A  different  antenna 
then  was  chosen  for  transmit  and  the  process  was  repeated.  The  raw 
antenna  isolation  data  then  was  calibrated  using  the  reference  line 
data  and  the  results  recorded  as  a  function  of  frequency. 


The  large  anechoic  chamber  in  the  Rome  Laboratory  Electromagnetic 
Environmental  Effects  Research  Center  has  been  utilized  for  many 
years  for  a  variety  of  antenna  measurements  and  the  sources  of  error 
and  their  management  are  well  known.  Careful  calibration  procedures 
are  performed  before  each  set  of  measurements  are  recorded  and  at 
selected  intervals  during  the  measurements.  A  summary  of  sources  of 
error,  their  magnitudes,  and  an  estimated  measurement  accuracy  are 
given  below  for  both  the  TSAM  antenna  pattern  measurements  and 
antenna  isolation. 

Antenna  pattern  gain  measurements  are  subject  to  errors  in  the 
standard  gain  antenna  radiating  source,  along  the  signal  propagation 
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path,  in  the  switching  network,  the  receiver,  and  the  test  article 
positioner.  The  signal  source  drifts  in  both  power  and  frequency. 
This  paraineter  drift  contributes  an  estimated  ±0.3  dB  error  to  the 
TSAM  pattern  measurement  as  well  as  an  additional  ±0.2  dB  error  due 
to  polarization  impurity.  The  signal  propagation  path  is  subject  to 
the  effects  of  multipath  that  exist  in  the  chamber.  Chamber 
reflectivity  measurements  add  an  estimated  ±0.1  dB  error  to  pattern 
measurements.  The  RF  switches  used  for  switching  between  the  TSAM 
antennas  and  the  RF  connecting  cables  will  affect  signal  power  levels 
due  to  limitations  in  calibration  and  temperature  drift  and  insertion 
loss.  Each  factor  contributes  approximately  ±0.5  dB  to  the 
measurement  error.  Finally,  receiver  drift  adds  about  ±0.3  dB  to  the 
error.  Positioner  error  is  approximately  ±0.1°  and  is  not  considered 
a  signifigent  contributor  to  antenna  gain  pattern  measurements.  The 
resulting  antenna  pattern  measurement  accuracy  is  estimated  to  be 
±0.8  dB. 


Antenna  isolation  measurements  are  subject  to  a  similar  set  of 
errors.  Care  was  taken  to  eliminate  any  systematic  errors.  The 
random  errors  considered  in  the  isolation  measurement  and  their 
associated  estimates  of  error  are:  reference  line  calibration  (±0.5 
dB  VSWR,  ±0.7  dB  attenuation)  ;  losses  in  the  RF  switching  network  and 
connecting  cables  (±0.2  dB  switch  actuation,  ±0.5  dB  calibration); 
microwave  component  temperature  drift  (±0.3  dB) ,  and  receiver  drift 
(±0.1  dB  receiver  error,  ±0.5  dB  dynamic  settling  error).  Signal 
multipath  effects  were  not  considered  signifigent  since  the  ”on  TSAM” 
coupling  paths  were  dominant.  The  measurement  accuracy  for  antenna 
isolation  measurements  was  calculated  to  be  ±1.18  dB. 

Initial  TSAM  Pattern  and  Isolation  Meagured.  Resultei 

A  set  of  principal  plane  pattern  cuts  are  shown  in  Figure  4  at 
6  GHz  for  antenna  3  (see  Figure  1)  transmitting  for  the  full  TSAM 
configuration  and  for  the  cylindrical  fuselage  only.  This  choice  of 
measurements  illustrates  the  ease  with  which  large  configuration 
changes  can  be  made.  The  smooth  patterns  for  the  cylinder  contrast 
with  the  scalloping  patterns  of  the  full  structure,  especially  for 
the  roll  and  yah  cuts. 

A  set  of  isolation  measurements  are  shown  in  Figures  5a~c.  In  each 
case  antenna  2  is  the  transmitting  antenna  and  the  isolation  is 
measured  for  antennas  1,  3,  and  4  over  the  frequency  range  4-8  GHz. 
for  the  cases  of  engines  off  and  on  TSAM.  Antenna  2  is  the  bottom 
front  antenna  while  antennas  1-3  are  on  top  of  the  cylinder.  Even 
though  antennas  1  and  2  are  located  away  from  the  engines, ^  scattering 
effects  show  up  as  scalloping  on  the  results.  The  same  is  true  for 
antennas  3  and  2.  Antennas  4  and  2  are  located  on  the  bottom  of  the 
cylinder  and  show  less  isolation. 

AoKnpwIgdggment : 

To  Rome  Research  Corporation,  Rome  NY,  for  all  TSAM  measurements. 
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Figure  4 

ISAM  Patterns  -  Antenna  3  -  6  GHz. 


ISOLATION  (dB) 


Figure  5a.  Antenna  2  -  Antenna  1  Isolation 


Figure  5b.  Antenna  2  -  Antenna  3  Isolation 


Figure  5c.  Antenna  2  -  Antenna  4  Isolation 
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Measurement  Study  for  Validation  of  Electromagnetics  Scattering  Codes 
on  a  Complex  3D  Target 

T.L.  Kienberger  and  D.E.  Jurgens 
Boeing  Defense  and  Space  Group 


L  INTRODUCTION 


Low  radar  cross  section  (RCS)  targets  are  becoming  prevalent  and  are  presenting  several  new 
challenges  for  electromagnetic  scattering  codes.  The  physical  target  shapes  and  material  characteristics  are 
continually  being  refined  and  optimized.  Traditionally  analysis  codes  are  validated  by  comparing  the 
predictions  to  a  "trusted"  code  or  against  an  exact  solution.  Because  RCS  levels  are  decreasing  and  the 
complexity  of  the  targets  are  evolving,  the  importance  of  establishing  industry  standard  low-level  validation 
cases  with  measured  data  is  increasing. 

This  paper  describes  one  of  the  methods  used  by  the  Boeing  Defense  and  Space  Group  to  validate  the 
accuracy  of  3D  predictions  of  low  level  targets.  As  previously  mentioned,  one  of  Ae  most  common  ways  of 
validating  analytical  results  is  to  compare  the  output  to  a  previously  validated  code.  However,  on  more 
complicated  targets,  when  analysis  produces  varied  results  it  is  difficult  to  know  with  certainty  which  of  the 
codes  is  correct.  This  study  originated  fi:om  a  just  such  a  debate. 

A  test  body  was  designed  with  features  representative  of  today's  most  challenging  targets.  The  target 
was  manufactured  and  then  measured  at  the  Boeing  Compact  Radar  Range.  A  comprehensive  set  of  angles 
and  fi'equencies  where  used  to  gather  data.  The  primary  purpose  of  this  research  was  to  collect  data  for  code 
validation. 

The  measured  data  was  compared  to  predictions  from  two  industry  standard  method  of  moments 
(MoM)  codes.  The  results  are  presented  in  Section  V  along  with  comments  and  conclusions. 


IL  MEASUREMENT  PLATFORM 

The  measurement  platform,  here  after  referred  to  as  the  target,  used  for  this  study  was  chosen  not  to 
demonstrate  the  capabilities  but  to  explore  the  limitations  of  a  given  code.  The  target  was  to  have  challenging 
features  based  on  an  user's  concerns  of  design  problems  of  interest  to  Boeing.  Thus  instead  of  a  series  of 
separate  targets  to  address  features  such  as  narrow  wedge  angles,  multiple  junctions  and  material  interfaces, 
one  target  is  desired  where  these  features  can  be  identified  but  not  necessarily  isolated  from  one  another.  The 
final  consideration  in  the  target  design  was  size.  A  target  and  corresponding  measured  data  were  desired 
which  would  be  typical  of  tiiose  run  on  a  massively  parallel  computer,  that  is  large  for  a  numerical 
computation.  At  the  same  time  it  was  desirable  that  the  physical  dimensions  allow  for  ease  of  handling 
during  data  acquisition  and  that  cost  of  model  alterations  be  minimal.  These  considerations  along  with  the 
available  measurement  facility  and  its  capability  resulted  in  the  following  model. 

The  target  is  an  asymmetric  triangle  in  the  planform  or  top  view,  see  figure  la.  The  length  of  the 
three  sides  are  176.7,  140.0,  and  77.7  inches.  It  is  considered  thin  in  that  the  volumetric  height/depth  is 
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small  compared  to  die  length  of  each  planform  side,  see  figure  lb.  This  model  has  1  volumetric  edge  and 
two  "knife"  edges.  A  kmfe  edge  is  defined  as  a  contour  tapering  \\duch  approaches  a  very  thin  sheet.  The 
volumetric  edge  has  a  height  of  approximately  2  inches.  The  targets  body  is  coated  with  a  conductive  paint, 
dius  it  can  be  modeled  as  a  perfect  electncal  conductor,  or  PEC.  Further  more  the  target  is  a  closed  volume 
without  the  addition  of  any  edges.  The  target  is  also  symmetric  in  the  volumetric  direction. 

Edges  were  attached  to  all  3  sides.  The  primary  reason  was  to  verify  multiple  surface  junction 
calculations  of  differing  materials.  Two  of  the  targets  edges,  in  particular  sides  B  and  C  as  depicted  in  figure 
1,  are  very  thin  sheets.  The  bcxfy  of  the  target  narrows  to  knife  edges  on  these  sides.  The  longest  side  of  the 
target  terminates  with  a  wall  height.  Thus  the  edge  added  to  this  side  must  have  some  volumetric  thickness 


(a) 

Figure  1,  Triangular  Measurement  Fixture,  views  produced  with  IDEAS^ 

in  order  to  maintain  the  smooth  contour  of  the  target.  The  edge  on  this  side  has  the  shqie  of  a  wedge  where 
the  wedge  angle  is  defined  such  that  the  shapes  contour  is  continuos  to  the  first  order  derivative.  The  edge 
shape  also  allows  for  a  bulk  material  to  be  placed  inside  this  edge.  For  this  study,  however,  only  tower  foam^ 
was  used  for  structural  support. 

This  paper  presents  results  fi*om  two  configurations.  They  will  be  referred  to  as  the  resistive  case  (1) 
and  the  conducting  case  (2).  The  conducting  case  is  simply  tiie  same  as  the  resistive  case  except  that 
aluminum  foil  was  used  to  cover  all  edge  materials.  At  the  firequencies  used  in  this  study  the  minimal  surface 
irregularities  are  very  small  compared  to  the  wavelength.  Thereby  a  PEC  for  the  baseline  case  in  this  study  is 
created  \^hich  effectively  simulates  a  smooth  conducting  surface.  Thus,  only  the  treated  case  is  described  in 
detail  here. 


*  Tower  foam  is  similar  to  ordinary  styrofoam.  It  is  lossless  which  means  it  does  not  dissipate  radiated  energy  at  any  significant 
level.  It  can  be  modeled  as  a  having  the  properties  of  free  space  or,  as  is  customary,  it  can  be  completely  ignored  in  the  numerical 
calculation. 
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The  material  used  for  the  treated  case  was  very  thin  sheets  of  macrofol^.  Each  edge  was  constructed 
in  the  same  manner.  The  knife  edges,  sides  B  and  C,  are  the  same  as  the  volumetric  edge,  side  A,  except 
collapsed  down  to  two  thin  sheets  adhered  together.  The  edges  were  constructed  in  a  symmetrical  sense  thus 
only  one  side  of  any  given  edge  need  be  described.  The  edge  is  approximately  4.5"  wide.  It  is  constructed  of 
four  strips  of  macrofol  with  differing  material  characteristics.  This  was  done  to  evaluate  the  ability  to  model 
edge  to  edge  interfaces  of  differing  material  properties.  Lossless  tower  foam  type  material  was  used  for 
internal  support  on  the  volumetric  edge  and  external  support  on  the  very  thin  edges.  For  target  dimensions 
with  the  edges  installed  see  figure  2. 


Figure  2,  Triangular  Measurement  Fixture  with  edges, 

IIL  TARGET  RANGE  MEASUREMENT 


Range  measurements  do  not  produce  absolute  values,  that  is  exact  backscattering  levels  for  a  given 
target.  Error  mitigation  and  data  processing  is  usually  required  even  in  the  best  of  circumstances. 
Measurements  performed  in  a  controlled  environment  will  produce  a  good  approximation  of  the 
backscattering  level  which  is  repeatable  within  a  certain  tolerance.  For  this  reason  the  range  facility  and  the 
data  acquisition  method  deserve  some  definition. 

All  range  backscattering  data  presented  in  this  paper  was  taken  at  Boeing  Defense  and  Space 
Group’s  9-77  Building  Indoor  Compact  Radar  Range[l]  over  a  time  period  of  4  days.  This  compact  radar 
range  uses  2  reflectors  to  create  a  far  field  condition.  That  is  the  wave  fronts  are  propagating  nearly  parallel 
after  the  second  reflection.  This  is  easily  visualized  by  comparing  to  the  use  of  lenses  in  optics  to  collimate  a 
light  beam.  The  physical  space  where  models  are  positioned  is  referred  to  as  the  "quiet  zone."  This  provides 
a  minimum  noise  area  within  \^dlich  a  target  may  be  placed,  see  figure  3. 


^  Macrofol  is  an  impedance  sheet  material  which  can  be  model  as  having  a  constant  ohms  per  square  value. 
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The  model  was  hung  from  four  string  reels  connected  to  the  model  at  4  locations.  These  strings  are 
used  to  control  model  yaw  and  pitch.  Unfortunately,  the  model  attachment  points  were  not  optimally  placed 
for  it's  given  weight.  Thus,  as  the  model  was  rotated  to  obtain  angular  data  it  flew  a  slight  orbit  around  the 
theoretical  axis  of  rotation.  The  target  remained  within  the  quiet  zone  and  thus  within  the  simulated  plane 
wave  field  at  all  times,  thus  only  a  phase  (and  not  a  magnitude)  displacement^  occurs  in  the  data. 


Figure  3,  Boeing  Defense  &  Space  Group’s  indoor  compact  radar  range . 


The  range  is  characterized  by  taking  various  types  of  data.  Some  of  the  more  significant  are  listed 
here.  The  range  is  calibrated  using  a  known  target,  such  as  a  sphere,  positioned  in  the  quiet  zone  as 
downrange  strobes  are  collected.  A  downrange  collects  scattering  data  from  the  full  length  of  the  chamber  by 
varying  the  return  signal  time  at  the  receive  gate.  The  return  of  the  sphere  should  occur  within  the  quiet  zone 
at  the  known  level.  Adjustments  to  subsequent  radar  data  are  made  based  on  this  calibration.  Empty 
chamber  data  is  collected  as  well.  This  defines  the  noise  level  of  the  range  at  any  given  frequency. 


^  This  range  facility  utilizes  a  highly  precise  laser  system  (POMS  [1])  to  report  exact  target  position  within  the 
global  measurement  coordinate  system.  Thus,  adjustment  for  phase  displacement  in  the  data  can  be  easily 
applied 


279 


Data  was  taken  in  a  small/narrow  band.  Of  interest  was  any  set  of  frequencies  with  corresponding 
wavelengths  much  smaller  dian  the  targets  overall  dimensions.  Two  transmit-receive  horns  were  used  in  this 
band.  Presented  in  this  paper  are  corresponding  calculations  within  the  first  horn’s  range.  The  treated  target 
was  near  the  noise  floor  at  this  band,  however  sufficient  data  was  taken  to  restore  most  of  the  data  using 
standard  signal  processing  methods.  For  example,  the  range  noise  is  measured  without  a  target.  Then  this 
data  is  subtracted  using  the  complex  field  from  the  data  with  the  target. 

The  target  data  was  collected  for  various  aspect  cuts.  The  aspect  angle  is  the  target's  orientation  with 
respect  to  the  incoming  field  direction  or  radar  look  angle.  Data  for  two  aspect  cuts  was  collected.  First,  was 
a  0°  aspect  cut.  At  0^  aspect  the  target’s  is  horizontal  with  respect  to  the  look  angle.  Conversely  a  90°  aspect 
cut  would  position  the  target  normal  to  the  look  angle. 

The  second  cut  was  a  10°  great  circle.  A  great  circle  means  a  changing  aspect  angle.  That  is  the 
target  is  tilted  10°  (aspect  10°)  and  then  rotated  without  correction  to  the  aspect  angle. 

Third,  a  conic  of  5°  was  attempted.  A  conic  maintains  the  aspect  angle  of  5°  as  the  target  rotates. 
Unfortunately,  due  to  die  target’s  attachment  points  diis  cut  could  not  be  accomplished.  The  length  of  the 
strings  which  hold  the  target  must  constantly  be  adjusted  and  if  the  attachment  points  with  respect  to  the 
targets  weight  and  center  of  gravity  are  not  placed  optimally  it  becomes  extremely  difficult  to  maintain  the 
constant  aspect  angle  as  the  target  rotates. 

For  the  first  two  aspect  cuts  data  was  collected  for  both  cases,  PEC  and  resistive.  Range  calibrations 
taken  before  and  after  the  above  data  collection  process  was  within  an  acceptable  tolerance.  Thus,  all  data 
taken  is  valid  within  the  same  relative  error  bar. 


IV.  CALCULATED  SCATTERING 


Scattering  results  presented  in  this  paper  were  calculated  with  FERM  [2],  a  Method-of-Moments  (or 
MoM)  code  produced  by  NOT,  and  CarJos-^iD  [3],  a  MoM  code  produced  by  McDonnell  Douglas.  Method- 
of-Moments  is  primarily  a  solution  method.  Both  of  these  codes  use  MoM  to  solve  for  the  scattering  from  a 
body  in  a  similar  way.  The  body  is  discretized  using  facets,  in  particular  infinitely  thin  shell  triangular  facets. 
One  then  solves  for  the  currents  on  these  facets  induced  by  an  external  field.  Thus,  the  number  of  facets 
which  define  the  body  has  an  obvious  affect  on  the  solution.  Also,  the  induced  current  is  affected  by 
neighboring  facets.  Therefor,  the  proximity  of  surrounding  facets  must  also  be  considered.  In  general,  the 
facet  definition  of  a  body  can  have  a  significant  impact  on  the  solution.  For  this  reason  the  analysis  model  and 
die  generation  there  of  is  described  in  detail. 

The  faceted  calculation  model  was  generated  from  a  surface  representation  (IGES  128)  of  the  target. 
All  surfaces  on  the  target  were  generated  from  scratch  using  IDEAS™[4]  and  TOPGUN  [5].  Theses 
surfaces  are  only  accurate  to  within  +/-0.1  inches,  since  the  target  was  not  originally  built  with  a  CAD 
system.  In  general,  this  error  would  be  of  concern  since,  as  stated  above,  the  number  of  facets  which  define 
an  object  affect  the  solutioa  However,  the  number  of  facets  reaches  a  saturation  level  which  is  a  frequency 
dependent.  A  more  exact  requirement  on  the  facets  is  specified  by  requiring  the  length  of  the  facet’s  edges  to 
be  defined  in  terms  of  the  wavelength  (X)  at  the  calculation  frequency.  Furthermore,  resolution  beyond  a  X 
per  length,  e.g.  X  /20,  will  not  significantly  increase  the  calculation  solution  and  this  quantity  could  be  defined 
as  maximum  resolution.  Thus  the  above  margin  of  error  in  the  surface  definition  is  acceptable  for  calculation 
frequencies  below  the  maximum  resolution  quantity. 
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The  facetization  of  the  surfaces,  see  figure  4,  was  accomplished  using  GRIDDER  [6]  and  IDEAS™ 
[4].  The  facet  resolution  of  the  target  is  on  the  average  of  X  /lO.  Figure  5  provides  a  general  gauge  for  the 
model  resolution.  Plotted  are  the  edges  sorted  by  length  with  various  resolution  levels  depicted.  There  are 
other  factors  involved  in  creating  a  faceted  model  which  must  also  be  addressed.  These  factors  are  identified 
in  rule-of-thumb  meshing  criteria  which  generally  produce  good  results  in  the  calculation.  These  criteria  are 
termed  rule-of-thumb  since  they  are  not  identified  in  the  user  manual  provided  with  the  analysis  code.  These 
heuristics  are  a  result  of  trial  and  error  as  well  as  experience  with  similar  numerically  complex  calculation 
codes.  Two  of  these  factors  taken  into  consideration  are  1)  the  size  and  spacing  of  die  facets  and  2)  the 
distance  between  facet  centers  lying  in  parallel  planes. 


Figure  4,  Faceted  model,  views  produced  with  IDEAS^. 


In  general,  equilateral  triangles  spaced  evenly  or  spaced  such  that  neighboring  facet  variation  is 
minimized  produces  good  results  in  the  calculated  results.  This  model  maintains  a  maximum  variation  of 
20%  or  less.  Additionally,  the  distance  between  two  facets  forming  a  wedge  or  two  facets  lying  on  parallel 
planes  veiy  close  together  can  cause  singularity  problems  in  the  calculation.  This  problem  was  avoided  in 
this  model  by  collapsing  facets  vdiose  centers  were  within  a  3®  wedge  angle.  Due  to  the  symmetry  of  the 
target  some  of  the  comer  facets  of  the  model  were  also  collapsed. 

It  is  important  to  accurately  define  die  physical  model,  or  target,  however,  the  accuracy  of  die 
calculation  model  is  defined  by  limiting  factors  in  a  numerically  complex  calculation  code.  The  more 
significant  factors  in  diis  study  are  the  per  wavelength  discretization  and  singularity  criteria  which  the 
calculation  code  can  handle.  Thus,  for  the  results  presented  in  this  study  the  calculation  model  is  an  accurate 
representation  of  the  physical  model. 
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Edge  Statistics  for  FULL_SHARP.unv 


Edges  Sorted  by  Length 


Figure  5,  Faceted  model  edges  sorted  by  length  as  a  resolution  gauge. 


V.  RESULTS/CONCLUSIONS 


This  study  was  not  conducted  to  advocate  the  use  of  one  complex  numerical  calculation  code  over  the 
use  of  another.  Several  3D-MoM  codes  have  been  created  and  released  within  die  last  few  years.  This  study 
was  initiated  to  provide  a  complex  platform  from  which  a  verification  database  could  be  built.  At  the  release 
of  this  paper  only  numerical  calculations  were  accomplished  using  FERM  [2]  and  Carlos-3tD  [3],  The  model 
contains  25,000  unloiowns  which  even  for  a  MPP  is  a  large  problem.  Comparisons  of  the  conducting  and  the 
treated  case  are  shown  only  at  one  frequency. 

Comparisons  of  die  conducting  case  are  shown  only  for  FERM  and  the  measured  data,  see  figure  6. 
The  problem  of  a  PEC  has  been  thoroughly  studied  and  it  is  assumed  at  a  minimum  that  a  3D-MoM  code  can 
accurately  calculate  scattering  for  this  case.  None  the  less,  as  a  baseline  case  the  conducting  model  was 
calculated  but  this  configuration  receives  a  lower  priority. 

Of  greater  concern  in  this  study  is  the  treated  case.  Figure  7  shows  a  comparison  of  calculated  data 
from  FERM  and  Carlos-3I>  to  the  measured  data  The  data  presented  here  is  not  shown  with  error  bars. 
Exact  margins  of  error  have  not  been  calculated.  The  data  presented  in  this  study  has  been  processed  to 
provide  a  clear  comparison  over  several  angles.  The  data  is  processed  by  calculating  the  median  for  a  5*^ 
window  around  every  degree  between  0*  and  180^  at  1^  increments.  This  removes  the  hashing  which  is 
typical  in  a  scattering  measurement,  see  figure  8. 
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Verification  Scattering  Comparison  of  0  degree  Aspect  Cut 
Frequertcy:  880MHz:  Polarization:  Horizontal; 
Measured  (black-solid)  /  PERM  (Blue-solid) 


Figure  6,  Conducting  case  results  for  PERM  calculated  and  measured  data. 


Verification  Scattering  Cofnparison  of  10  degree  Great  Circle  Cut 
Frequency:  880MHz;  Polarization:  Horizontal: 

Measured  (black-solid)  /  PERM  (Blue-solid)  /  Carlos-3D  (Red-dotted) 


Figure  7,  Conducting  case  results  for  PERM,  and  Carlos-3D  calculated  and  measured  data. 
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Boeing  Volumetric  Triangular  Measurement  Fixture 
Configuration:  ctb2_lmb1_gc10  Freq:  0.880  Median  5/1 


Anglos 


Figure  8,  Effect  of  post  processing  data. 


VI.  FURTHER  STUDY 


As  stated  above,  the  primary  purpose  of  this  stucfy  was  to  collect  data  for  validation  of  complex 
numerical  codes.  The  goal  of  this  study  was  to  create  a  measurement  fixture  wdiich  could  serve  to  verify 
several  capabilities  of  a  code.  This  stucfy  investigated  PEC  and  infinitely  thin  sheet  impedance  surfaces.  Two 
directions  of  further  study  have  been  identified.  One  evolves  changes  to  the  physical  model.  The  other 
concentrates  on  refinement  of  the  faceted  model  and  calculations  with  other  complex  numerical  analysis 
codes. 

Further  study  needs  to  collect  measurement  data  fi-om  this  basic  target  with  each  model  alteration. 
Dielectric  and  magnetic  materials  should  be  applied  to  the  model  individually  and  together.  Thin  materials, 
which  cannot  be  approximated  as  infinitely  thin  sheets,  and  layered  materials  need  to  be  addressed  as  well. 

In  the  other  direction,  refinement  of  the  faceted  model  to  provide  better  resolution  is  planned.  Studies 
on  the  other  polarization,  this  paper  presented  on  one  polarization,  and  with  other  3D-MoM  codes,  such  as 
PARAMOM  [7]  will  also  be  addressed. 

In  closing  the  authors  would  like  to  stress  the  necessity  of  a  verification  platform  of  this  nature.  Good 
measurement  data  of  complex  objects  is  becoming  more  significant.  The  ability  of  numerically  complex 
scattering  codes  is  steadily  increasing.  Likewise,  the  availability  of  MPC  (Massively  Parallel  Computers)  and 
the  typical  matrix  size  winch  can  be  solved  are  making  jumps  in  development.  A  complex  verification 
database  should  be  required  as  a  baseline  check  to  maintain  the  pace  in  development  of  computers,  in 
particular  MPCs,  and  complex  numerical  codes,  specifically  electromagnetic  scattering  codes. 
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1  Introduction 

In  validating  radar  cross  section  or  surface  current  solutions  obtained  by  numerical  methods  an  important  issue  is 
to  obtain  more  accurate  reference  data.  As  analytical  solutions  are  available  only  for  a  limited  number  of  objects, 
other  approaches  are  needed  for  more  general  objects.  The  approach  presented  here  is  also  based  on  numerical 
calculation,  but  this  method  has  the  advantage  of  using  an  exact  representation  of  the  object.  A  drawback  of  this 
approach  is  that  only  simple  geometries  can  be  represented  in  this  way.  Nevertheless  the  method  is  useful  for  val¬ 
idating  purposes  as  the  object  used  for  validation  still  can  be  chosen  quite  freely. 

The  validation  would  be  performed  as  follows:  a  suitable  object,  possible  to  represent  as  mentioned,  is  chosen 
and  the  scattering  problem  is  solved  using  both  the  presented  method  and  the  method  to  be  validated.  After  the 
two  calculations  the  results  are  compared,  and  the  accuracy  of  the  method  to  be  validated  is  estimated.  The  vali¬ 
dated  method  then  can  be  used  to  model  more  complicated  objects.  Obviously  the  object  selected  for  validating 
purposes  should  be  chosen  according  to  the  setting  in  which  the  validated  method  will  be  used. 

The  presented  method  solves  the  Magnetic-Field  Integral  Equation  (MFIE)  and  the  Augmented  Magnetic-Field 
Integral  Equation  (AMFIE)  for  the  specified  geometry.  The  method  consists  of  five  steps: 

1.  Divide  the  surface  into  patches. 

2.  Expand  the  surface  current  in  appropriate  basis  functions. 

3.  Multiply  the  equation  with  test  functions. 

4.  Calculate  the  integrals  in  question. 

5.  Solve  the  system  of  linear  equations  that  emerges. 

The  approach  has  been  implemented  in  MATLAB  and  FORTRAN.  MATLAB  is  used  for  user  interface,  plotting, 
object  generation  and  so  on,  while  the  necessary  integrations  are  carried  out  by  the  FORTRAN  code.  The  code  is 
meant  to  run  on  a  workstation,  and  not  a  mainframe,  which  makes  the  calculation  cheap  and  simple  to  perform. 
The  various  graphs  in  this  paper  have  been  generated  with  only  workstation  hardware.  The  objects  available  to 
the  user  are  such  as  spheres,  cylinders,  cones,  ogives,  boxes,  and  so  on.  It  is  also  possible  to  add  new  types  of 
objects  (and  even  new  patch  types).  Some  results  obtained  from  the  code  will  also  be  presented,  and  it  will  be 
shown  that  the  accuracy  increases  significantly  by  using  exact  representation  of  the  object. 

2  Patch  representation  and  basis  functions 

The  surface  must  be  divided  into  patches  that  can  be  mapped  onto  some  surface  where  the  numerical  integration 
is  easy  to  perform  (the  patches  must  of  course  be  numbered,  but  the  patch  index  is  suppressed  here  for  simplic¬ 
ity).  One  possible  and  attractive  choice  is  to  have  patches  with  four  comers,  which  are  easily  mapped  to  a  square. 
This  is  nice  because  there  are  Gaussian  2-dimensional  numerical  integration  (qubature)  formulas  for  a  square 
which  are  simple  but  accurate.  The  patch  might  look  like  in  figure  1  below. 
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Figure  1.  A  surface  patch  with  the  corresponding  surface  coordinates,  normal  and  tangent  vectors 


The  normal  and  tangent  vectors  are  chosen  as 

f  =§L/|9r| 

‘  bui 

f  IL/I^I  (2-2) 

|av| 

n  =  ti  xt2 

If  the  mapping  used  is  conformal,  the  result  is  that  the  vectors  in  (2-2)  are  orthogonal.  This  ensures  a  unique 
expansion  if  the  needed  vector- valued  basis  functions  are  based  on  the  vectors  t]  and  ti  -  The  basis  functions  are 
chosen  as  follows: 


I-  =  0,  i  =  1 ...  6  elsewhere 

This  choice  of  basis  is  made  because  these  basis  functions  are  defined  only  on  one  patch,  and  has  no  impact  on 
the  neighbouring  patches.  The  main  benefit  is  that  neighbouring  patches  need  not  lie  comer-to- comer,  which 
means  that  if  an  object  with  different  curvatures  on  different  parts  of  the  surface  is  to  be  generated,  the  different 
surfaces  may  be  generated  separately,  and  joined  without  effort  (see  figure  2  below).  Another  benefit  is  that,  e  g, 
a  spherical  surface  may  be  divided  into  patches  which  are  squares  in  the  0-(p-plane  but  all  have  approximately 
equal  side  length  in  real  space.  This  is  important  as  we  want  to  avoid  having  many  small  patches  on  one  part  of 
the  surface  and  some  large  ones  on  another  part,  which  would  imply  wasting  computational  effort  on  the  small 
patches  while  the  large  ones  give  rise  to  a  large  error. 


i,  -  t,  14  -  t2 

-  uti ,  j  is  =  ut2  on  the  current  patch 
is  =  Vti  is  =  Vt2 
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Circles,  cylinders  and  cones  are  three  types  of  objects  that  can  be  described  with  the  same  parameterization: 


ri+  (r2-ri)  (  — ) 

(P  -  (Pl+  ((p2“9i)  (  — ) 
U  +  1 

Zj+  (Z2-Z1)  (-— ) 


rcos<p! 
r  =  rsincp' 


The  triangular  patch  is  needed  for  the  representation  of  a  pyramid,  for  instance.  Also,  if  there  was  a  triangulation 
available  on  some  complex  object,  it  would  be  nice  if  the  method  used  here  was  able  to  use  it. 

In  the  above  technique  the  patches  are  easy  to  map  conformally  from  the  u-v-plane  to  the  f' -space.  The  mapping 
is  sometimes  non-conformal  at  one  point  of  the  patch,  but  this  is  always  at  an  edge  of  the  patch,  so  it  does  not 
lead  to  a  problem  in  the  above  cases.  A  conformal  mapping  from  a  square  to  a  triangle  is  possible  to  construct, 
but  it  would  be  rather  complicated.  It  would  also  imply  that  the  tangent  vectors  would  vary  over  the  surface,  as 
on  the  curved  patches  above,  which  seems  a  bit  inappropriate  as  the  vectors  on  the  surface  patch  would  be  easily 
expressed  with  two  constant  orthogonal  vectors.  A  simpler  approach  would  be  to  copy  the  expression  (2-5),  but 
to  contract  one  side  to  a  point.  The  result  is 


.  ,  .  .u  +  1.  .  A 

(x^-i-  (X2-Xj)  (-y-))  •  (- 


-  V.  , 1  +  V. 

y)  +Xo-  (  — ) 


yi+  (y2-yi)  (^) 


f  =  Tf'-i-fo 

This  mapping  is  non-conformal  on  the  entire  patch.  However,  this  does  not  impose  a  problem  as  the  needed  nor¬ 
mal  and  tangent  vectors  may  be  chosen  as 

ti  =  Tx'  =  ^/&l  (!) 

ou  I  Bui 

lavl 

h  =  ti  X  t2  =  Tz' 

Note  that  the  choice  of  tangent  vectors  must  not  necessarily  be  the  one  mentioned  earlier.  We  only  need  orthogo¬ 
nal  vectors  tangent  (and  normal)  to  the  patch.  The  basis  functions  are  obtained  from  these  vectors  as  usual.  As  for 
the  surface  element,  the  mapping  is  non-conformal  and  (2-4)  does  not  work.  We  must  use  the  Jacobian  instead, 
and  thus  the  surface  element  is 


dx'  dx' 

9“  Jjydv  = 
3y'  3y' 

3u  3v 


-Xi  i_v  y2-yi 
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3  Solution  of  the  equations  and  calculation  of  RCS 

The  test  functions  are  chosen  to  be  Dirac’s  delta  functions,  and  multiplication  and  integration  leads  to  a  system  of 
integral  equations  which  solves  the  MFIE  and  AMFIE  in  certain  points  at  the  surface.  We  thus  have  a  point 
matching  (or  collocation)  method.  The  integrals  is  numerically  calculated  by  FORTRAN  subroutines,  and  finally, 
the  system  of  equations  is  solved  in  MATLAB.  Some  results  calculated  with  this  method  is  now  presented. 

First  a  simple  spherical  problem  is  chosen,  and  since  the  solution  to  the  problem  of  a  perfectly  conducting  sphere 
can  be  calculated  analytically,  this  solution  will  be  used  as  reference.  In  figure  3  the  MFIE  is  used  on  spheres  with 
100  patches  and  different  types  of  surface  patches.  As  can  be  seen,  it  is  a  great  benefit  to  use  the  exact  surface 
representation.  The  relative  error  is  reduced  with  a  factor  ~30  on  the  interval  (0,3.7).  Outside  this  interval  the 
solutions  are  not  useful,  since  the  relative  error  increases  significantly  at  higher  frequencies. 


One  frequently  used  way  to  ‘repair’  the  damage  done  to  an  object  when  triangulating  is  to  increase  the  size  some¬ 
what  so  that  the  area  of  the  object  is  the  same  in  the  triangulation  as  in  real  space.  The  effect  of  such  a  ‘repara¬ 
tion’  is  also  shown  in  figure  3.  The  relative  error  improves  with  a  factor  --1.6  as  compared  to  the  ordinary 
triangulation.  One  of  the  improvements  of  the  ‘reparation’  is  rather  important,  though;  the  maxima  of  the  RCS 
appear  on  the  correct  frequencies. 


Figure  3.  Comparison  between  triangulation  and  exact  surface  representation 
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In  figure  4  the  MFIE  is  used  on  the  exact  surface  representation.  The  number  of  patches  is  varied,  and  it  is  appar¬ 
ent  that  doubling  the  number  of  patches  decreases  the  relative  error  by  a  factor  ~2.  When  using  200  patches,  the 
relative  error  is  less  than  1  %  as  high  as  k  •  r  »  4.5 ,  except  for  the  resonances  (compare  with  k  •  r  =  271 ,  that  is 
equivalent  to  a  wavelength  equal  to  the  radius  of  the  sphere).  An  upper  limit  to  the  number  of  patches  is  imposed 
by  the  available  memory,  because  the  method  utilizes  full  matrices.  The  hardware  available  for  this  work  set  this 
limit  at  approximately  200  patches.  When  exceeding  this,  a  lot  of  disk  swapping  starts  to  take  place,  which  slows 
down  the  calculations  almost  to  a  dead  stop. 
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Figure  4.  Different  number  of  patches 


54  patches 
- 100  patches 


—  200  patches 

These  resonances  are  now  the  main  problem.  A  way  to  remove  the  spurious  solutions  is  to  implement  the  Aug¬ 
mented  MFIE  [8].  This  is  effective  if  the  object  in  question  does  not  possess  any  rotational  symmetry,  and  then 
all  the  spurious  solutions  are  removed  by  the  AMFIE.  Below  the  result  is  plotted  from  usage  of  MFIE  and 
AMFIE  on  a  cube  with  96  patches.  Unfortunately,  the  solutions  differ  somewhat  because  the  AMFIE  is  more  sen¬ 
sitive  to  certain  errors  introduced  in  the  numerical  model  [1],[8].  Another  drawback  is  that  the  AMFIE  does  not 
remove  all  the  spurious  solutions  if  the  object  possesses  rotational  symmetry. 

Figure  5.  RCS  from  a  cube 


RCSrdB] 


To  illustrate  another  usage  of  the  equation,  a  calculation  has  been  made  on  an  ogive  with  74  patches.  Instead  of 
varying  the  frequency,  the  incidence  angle  is  varied.  The  angle  is  measured  from  the  ‘tip’  of  the  ogive.  Note  the 
low  front  and  back  RCS  seen  in  figure  6. 
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Figure  6.  Calculated  RCS  for  an  ogive 


One  way  to  visualize  the  surface  currents  is  to  plot  the  absolute  value  of  the  currents  on  the  surface  patch.  The 
results  of  some  calculations  are  shown  in  figure  7.  Darker  patches  means  that  less  current  flow  on  the  patch,  but 
the  color  scale  is  only  valid  in  each  diagram  by  itself. 

Figure  7.  Currents  on  a  sphere  and  a  cube. 


4  Conclusions 

We  have  seen  that  the  MFIE  and  the  AMFIE  may  be  solved  rather  accurately  with  this  approach,  especially  when 
compared  to  an  ordinary  triangulation  (see  section  3).  The  method  shows  considerable  accuracy  for  k  values  as 
high  as  4.5,  at  least  for  a  sphere.  A  drawback  is  that  the  matrices  are  full,  so  a  full  LU-  or  QR-factorization  is 
needed,  which  takes  a  large  amount  of  CPU-time.  Also,  memory  requirements  set  the  upper  limit  to  the  number 
of  patches.  The  calculations  are  quite  possible  to  perform,  though,  and  the  calculations  may  be  done  on  worksta¬ 
tion  hardware  (which  is  inexpensive  as  compared  to  mainframe  CPU-time). 

The  less  accurate  AMFIE  method  may  not  seem  to  be  useful,  but  it  can  still  be  used  in  conjunction  with  the  MFIE 
in  a  special  way.  The  MFIE  is  solved,  but  then  the  normal  component  of  the  magnetic  field  is  checked,  and  if  it  is 
nonzero  the  AMFIE  is  solved  instead  to  give  a  solution  where  the  normal  component  is  zero.  This  allows  for 
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usage  of  MFIE  at  frequencies  away  from  the  cavity  eigenfrequencies,  and  so  the  accuracy  of  the  MFIE  is 

retained,  while  most  of  the  spurious  solutions  are  eliminated  by  the  AMEDE. 

Some  closing  suggestions  for  continued  work  may  be  in  order: 

•  The  Galerkin  approach  could  be  utilized  in  the  MFIE  and  AMFIE  equations.  This  would  most  probably 
increase  the  accuracy,  but  at  the  price  of  increased  CPU-time.  One  great  benefit,  though,  is  that  this  does  not 
demand  more  memory,  and  thus  it  is  possible  to  increase  the  accuracy  somewhat  even  when  the  upper  limit  in 
memory  is  reached. 

•  It  has  been  noted  that  the  MFIE  is  not  well  suited  to  solve  thin  plate  problems  [2].  Surfaces  inserted  in  this 
equation  must  be  closed,  but  the  EFIE  can  solve  problems  which  contain  open  surfaces  accurately.  This  means 
that  the  implementing  the  EFIE  allows  for  a  larger  class  of  problems  to  be  solved.  Some  problems  in  imple¬ 
menting  the  EFIE  in  this  approach  has  been  noted;  e  g  the  collocation  method  is  not  possible  to  use  [1]. 

•  Some  work  could  also  be  done  to  implement  the  combined-field  or  the  combined- source  equations.  The  bene¬ 
fit  of  this  would  be  that  all  the  spurious  solutions  would  be  eliminated  [6], [7].  The  same  problems  as  concern 
the  EFIE  are  present  [  1  ] . 

•  Objects  generally  have  sharp  comers,  and  there  is  reason  to  suspect  that  the  currents  change  rapidly  there  (see 
for  instance  figure  7),  which  may  give  rise  to  a  large  error  in  the  current  expansion  on  patches  near  the  edges. 
To  prevent  this,  the  patches  near  the  comers  should  be  chosen  smaller  than  the  other  patches.  This  is  currently 
possible  to  perform  manually,  but  might  be  done  automatically  when  setting  up  a  problem. 
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Abstract 

Infrared  (IR)  measurement  techniques  developed  to  determine  electromagnetic  (EM)  field 
distributions  are  used  to  validate  predictions  made  by  Computational  Electromagnetic  (CEM) 
simulation  tools  such  as  the  Gener^  Electromagnetic  Model  for  the  Analysis  of  Complex  Systems 
(GEMACS)  code.  This  paper  provides  comparisons  between  measured  and  simulated  results  from 
various  scattering  experiments  to  demonstrate  the  feasibility  and  illustrate  the  application  of  the  IR 
measurement  technique  as  validation  technique  for  CEM  tools. 

A  horn  antenna  was  used  to  generate  the  fields  during  the  EM  scattering  experiments.  These 
experiments  were  performed  in  the  Rome  Laboratory  anechoic  chamber  to  prevent  unintentional  EM 
interference  fi-om  distorting  the  measurement  results.  An  IR  detector  material  is  placed  near  the  test 
object  to  absorb  a  portion  of  the  EM  field.  The  resulting  temperature  distribution  on  the  IR  detector 
is  measured  with  an  IR  measurement  system.  The  temperature  distribution  is  directly  related  to  the 
intensity  of  the  scattered  EM  fields. 

IR  Background 

The  IR  measurement  technique  is  a  minimally  perturbing  measurement  technique  used  to  map 
EM  fields  and  provide  a  visual  two  dimensional  representation  EM  field  distributions.  This  technique 
is  based  on  the  thermal  heating  which  occurs  when  EM  energy  is  absorbed  by  a  lossy  dielectric  or 
resistive  detector  material.  The  absorbed  EM  energy  is  converted  into  conducted  and  convected  heat 
energy.  The  absorbed  energy  is  then  re-radiated  as  EM  energy  that  is  concentrated  in  the  IR  band. 

This  technique  is  employed  as  a  diagnostic  tool  to  map  the  EM  fields  with  and  without  the  presence 
of  measurement  probes  [1].  A  sheet  of  carbon  loaded  Kapton  with  a  resistivity  of  100  ohms/square 
is  used  as  the  IR  detector  material.  It  absorbs  a  small  portion  of  the  EM  energy  which  causes  the 
surface  temperature  of  the  detector  to  rise  above  the  ambient  temperature  which  can  be  detected  by 
an  IR  measurement  system.  The  absorbed  power  in  a  volume  is  a  function  of  both  the  electric  and 
magnetic  fields  as  shown  in  the  following  equation: 

P.b.  =  J  o)^"H?)dv  (1) 
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where  a  is  the  conductivity  of  the  detector,  e*'  is  the  imaginary  permittivity  of  the  detector,  p'*  is  the 
imaginary  permeability  of  the  detector  and  o)  is  the  angular  frequency  of  the  incident  radiation.  A  thin 
lossy  detector  made  of  carbon  loaded  Kapton  is  placed  in  the  area  where  the  two-dimensional  map 
of  the  EM  field  desired.  The  conductivity  and  the  imaginary  components  of  the  permittivity  and 
permeability  cause  the  temperature  of  the  detector  to  rise  above  the  ambient  temperature.  The 
increase  in  temperature  is  proportional  to  the  local  electric  and  magnetic  field  intensity  at  every 
location  on  the  detector  [2],  Thus,  a  two-dimensional  map  of  the  field  intensity  is  produced.  The 
absolute  temperature  across  the  IR  detector  is  recorded,  digitized  and  stored  using  an  Agema 
Thermovision  infrared  measurement  system.  This  data  can  be  used  to  directly  obtain  the  relative  field 
intensity  or  absolute  field  intensity  through  a  temperature  calibration  procedure.  This  technique  has 
been  applied  to  determine  the  radiated  fields  from  antennas  in  both  the  near  and  far  field,  intensity  of 
EM  fields  coupled  through  aperture  in  a  shielded  enclosure,  and  modal  distributions  of  EM  fields 
excited  inside  metal  cavities  [3].  Another  application  of  the  IR  measurement  technique  will  be 
presented  in  this  paper  is  the  validation  of  CEM  analysis  tools  and  models.  Minimally  perturbing  IR 
measurements  wiU  be  directly  compared  to  the  CEM  simulated  data,  providing  insight  in  the  area  of 
model  development,  model  enhancement  and  simulation  techniques. 

IR  Measurement  Procedure 

Validation  of  the  CEM  predictions  were  accomplished  by  performing  IR  measurements  in 
Rome  Laboratory's  anechoic  chamber.  The  anechoic  chamber  is  used  to  minimize  unintentional  EM 
fields  and  EM  reflections  from  distorting  the  measurement  results  and  to  provide  a  stable  temperature 
environment.  The  measurement  technique  involves  placing  the  IR  detector  (carbon  loaded  Kapton 
sheet)  in  front  of  the  EM  radiating  source.  A  1-2  GHz  standard  gain  horn  is  used  to  generate  the  EM 
field.  The  physical  dimensions  of  the  horn  were  measured  to  allow  the  horn  to  be  properly  modeled 
for  the  CEM  simulations  and  to  determine  how  far  the  IR  detector  must  be  placed  from  the  horn  in 
order  to  be  in  the  far  field.  The  far  field  condition  is  determined  by  using  the  following  equation: 

Far  Field  =  2DVA.  (2) 

Where  (D)  is  the  largest  dimension  of  the  horn  antenna  (D  =  .5 1  meters)  and  X  is  the  wavelength  of 
the  transmitted  frequency,  at  IGHz,  the  calculated  far  field  is  1.7  meters.  The  IR  detector  is  placed 
on  a  styrofoam  block  to  minimize  EM  reflections  and  it  is  positioned  directly  in  front  of  and  at  various 
distances  from  the  face  of  the  horn  antenna.  A  long  wave  IR  scanner  has  a  spectral  response  of  8-12 
microns  and  can  detect  temperature  changes  as  small  as  .08  °C.  The  scanner  is  placed  on  a^s  with 
the  horn  antenna  and  IR  detector  as  shown  in  figure  1.  The  IR  scanner  samples  data  at  136  lines  per 
frame  and  272  samples  per  line,  the  data  is  digitized  and  stored.  A  synthesized  sweep  generator  is 
used  to  generate  the  transmit  signal,  it  is  pre-amplified,  then  amplified  by  a  200  Watt  TWT  Amplifier. 
The  output  of  the  TWT  is  connected  to  a  directional  coupler,  the  output  of  the  coupler  is  connected 
to  the  antenna.  The  coupler  samples  the  forward  and  reflected  power  and  is  displayed  by  a  power 
meter,  this  allows  the  power  transmitted  by  the  antenna  to  be  determined  and  the  power  density  at 
the  IR  detector  to  be  calculated.  The  IR  measurements  begin  by  measuring  the  background 
temperature  over  the  IR  detector  without  an  EM  field  applied.  The  background  temperature 
distribution  is  stored  in  memory  and  is  subtracted  from  the  temperature  distribution  of  the  EM  signal. 
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Computational  Electromagnetics  (CEM)  Tool 

The  General  Electromagnetic  Model  for  the  Analysis  of  Complex  System  (GEMACS)  version 
5.3  is  used  to  demonstrate  the  feasibility  of  the  IR  measurement  technique  in  the  validation  of  CEM 
codes  and  models.  GEMACS  is  a  robust  CEM  tool  that  employs  multiple  solution  techniques  to 
determine  various  electromagnetic  observables  at  high  and  low  frequencies,  interior  and  exterior  to 
scattering  objects.  These  observables  include  antenna  patterns,  field  distributions,  current  distributions 
on  structures,  and  EMI/EMC  transfer  functions  [4]. 

At  Rome  Laboratory,  GEMACS  is  primarily  used  for  the  "mounted  antenna"  problem. 
Antennas  tend  to  perform  to  specification  in  "free-space"  but  can  perform  poorly  when  mounted  on 
their  host  platform.  The  antenna/system  performance  can  be  degraded  due  to  platform  scattering  or 
even  antenna-to-antenna  coupling.  In  order  to  accurately  characterize  the  performance  of  these 
systems,  it  is  essential  for  the  CEM  tool's  computations  to  be  valid  and  also  for  the  antenna  and 
platform  models  to  be  constructed  very  carefully. 

Simulations 

GEMACS  is  used  to  predict  the  EM  field  distribution  resulting  from  a  horn  antenna  in  free- 
space  and  in  the  presence  of  a  cylindrical  scatterer  as  previously  described.  GEMACS  and  the  user 
must,  therefore,  model  the  antenna,  the  cylinder  and  the  EM  interactions  accurately. 

GEMACS  has  a  number  of  source  modeling  options  that  can  be  categorized  as  either  "stiff' 
or  "soft"  [4].  A  source  having  invariant  radiating  characteristics  is  called  a  "stiff  source  while  the 
radiating  characteristics  of  a  "soft"  source  are  influenced  by  the  presence  of  other  sources  and  the 
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host  platform.  Both  types  of  sources  have  been  applied  to  the  established  scenario. 

Data  Comparison 

In  the  following  section  the  GEMACS -simulations  results  and  the  IR  measurement  results  are 
represented  as  gray  scale  images,  areas  of  high  field  levels  and  high  temperatures  are  white  and  the  lowest 
field  levels  and  temperatures  are  black.  The  first  set  of  simulations  involves  the  use  of  the  GEMACS  PSRC 
command  [5].  One  feature  of  the  PSRC  command  analytically  represents  a  far-field  horn  antenna  based 
upon  user  supplied  geometric  and  electrical  information.  This  is  a  "stiff'  source  which  is,  therefore,  not 
affected  by  near  field  scatterers.  The  fi'equency  of  this  horn  model  is  1  GHz  and  the  field  data  is  computed 
at  2  meters  fi'om  the  face  of  the  horn.  The  plane  of  free-space  field  data  produced  by  this  model  is  shown 
in  figure  2.  The  IR  measurement  of  the  free-space  field  distribution  is  shown  in  figure  3.  The  measured 
and  simulation  data  compare  very  well,  the  number  of  rings  is  the  same  as  is  the  relative  field  intensity.  The 
measured  data  is  somewhat  noisy  and  is  not  as  defined  as  the  simulation  data,  this  is  due  to  the  power  level 
available  at  2  meters.  At  a  higher  power  level  or  closer  distance  to  the  source  the  IR  measured  data  would 
be  more  well  defined.  The  cylindrical  obstruction,  modeled  by  the  Geometric  Theory  of  Diffraction  (GTD) 
capability  in  GEMACS,  was  then  introduced  into  the  simulation.  This  cylinder  is  positioned  longitudinally 
in  jfront  of  the  source  centered  about  the  2  meter  distance,  this  data  is  shown  in  figure  4.  Note  that  only 
half  of  the  measurement  plane  is  simulated  to  reduce  computation  time.  The  correlation  between  the 
simulations  and  the  measurements  indicate  that  both  the  user  generated  model  and  the  GEMACS 
computations  are  valid  within  the  constraints  of  this  particular  scenario.  The  IR  measurement  of  the 
cylinder  positioned  longitudinally  in  front  of  the  source  is  shown  in  figure  5.  Again  the  IR  measured  data 
compares  favorably  with  the  simulation  data,  the  structure  is  similar  with  the  simulation  data  being  more 
defined.  The  hot  spot  on  the  side  of  the  cylinder  in  the  simulation  data  is  due  to  discontinuities  at  the 
boundary  of  the  cylinder.  The  next  simulations  address  the  use  of  "stiff'  sources  in  the  presence  of  near¬ 
field  scatterers.  A  2  GHz  PSRC  horn  source  is  positioned  0.5  meter  fi'om  of  a  GTD  cylinder  placed 
sideways.  The  data  is  computed  fi'om  the  source  to  the  cylinder  as  shown  in  figure  6.  The  data  doesn't 
compare  very  well  with  the  IR  measurements  shown  in  figure  7.  This  indicates  that  the  PSRC  horn  model 
is  not  appropriate  for  use  in  this  situation.  Next,  a  2  GHjz  spherical  wave  [4,6]  "stiff"  source  is  applied  to 
the  same  configuration  and  the  data  is  shown  in  figure  8,  and  compares  well  with  the  IR  measured  results 
in  figure  7.  Therefore,  the  IR  measurements  helped  to  identify  that  the  spherical  wave  source  should  be 
used  to  represent  a  horn  antenna  in  the  presence  of  a  small  near-field  object.  The  IR  measurement 
technique  h^  also  been  used  in  the  validation  a  Method  of  Moments  (MOM)  patch  model  of  a  1  GHz  horn 
antenna.  MOM  antenna  models  are  "soft"  sources  which,  therefore,  accounts  for  the  presence  of  near  field 
scatterers  like  those  found  on  aircraft  and  ships.  The  results  of  this  simulation  are  plotted  in  figure  9  and 
show  a  distinct  lack  of  agreement  between  the  simulation  and  the  IR  measurements  shown  in  figure  3 .  This 
model  is  therefore  not  valid,  probably  due  to  asymmetry  in  the  patching  of  the  horn  model. 

Conclusions 

IR  measurements  were  used  to  map  the  electromagnetic  field  distributions  generated  by  a  standard 
gain  horn  and  the  fields  scattered  by  a  metal  cylinder  for  both  near  and  far  field  conditions.  Each 
measurement  scenario  was  modeled  and  simulated  using  the  GEMACS  code  to  provide  a  direct  comparison 
with  the  measured  results.  The  measured  IR  data  compared  favorably  to  the  simulation  data,  the  general 
shape  and  relative  levels  of  the  radiation  patterns  are  in  close  agreement.  The  IR  measurement  technique 
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provides  a  quick  and  accurate  representation  of  antenna  patterns  and  electromagnetic  scattering  effects. 
The  IR  measurement  technique  can  also  be  used  to  identify  potential  problems  in  the  development  of 
antenna  models  and  simulation  techniques.  Thus  far  only  simple  geometric  shaped  scatterers  have  been 
considered,  however  as  more  complex  problems  are  considered  the  complexity  of  the  simulation  problem 
and  the  simulation  run  time  will  also  increases.  This  paper  has  demonstrated  the  feasibility  of  using  IR 
measurements  to  validate,  enhance  and  improve  the  performance  of  CEM  tools. 
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Figure  2 :  Horn  Free  Space  Simulation  Data  Figure  3 :  Horn  Free  Space  IR  Data 


Figure  4:  1/2  Cylinder  Simulation  Data  Figure  5;  IR  Data  of  Cylinder 

End  View 
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Abstract 

This  paper  analyses  the  EMI  at  an  ocean  observation  post.  Detailed  calculation  in  this  paper 
includes  five  aspects:a)  near-field  gain  of  paraboloid  antenna,  b)  gain  of  communication  antenna 
array, c)  coupling  prediction  between  communication  antennas  of  station  B  and  D,d)  EMI  of  com¬ 
munication  station  B  caused  by  radar  station  A,  and  e)  EMI  of  communication  station  D  caused 
by  radar  station  A. In  practice, some  arguments  we  calculated  are  also  measured  and  given  in  this 
paper.  We  find  that  the  calculated  values  are  very  close  to  the  measured.  The  object  of  above 
calculation  Is  to  analyse  EMI  of  communication  transceiver  which  would  happen  caused  by  radar 
and  to  provide  the  basis  for  EMC  design  of  the  Information  Transmitting  System. 

1.  Introduction 

It's  well  known  that  digital  microwave  circuits  should  stay  at  a  distance  as  far  as  possible  from 
radar  (>10  km).  But  at  the  ocean  observation  post  we  studied,  radar  and  communication  stations 
are  all  be  located  at  the  same  position  in  order  to  transfer  radar  information  conveniently.This  si¬ 
tuation  makes  it  difficult  to  conduct  EMC  design  of  Information  Transmitting  Systems.  There  are 
three  aspects  of  the  interference  of  microwave  communication  systems  caused  by  radaria)  freq¬ 
uency  spectrum  of  radar  signal, b)  parasitic  oscillation,  and  c)  wide-bands'  noise  of  radar's  output 
signal.  Whether  it  can  really  cause  interference  is  up  to  the  coupling  situation  between  radar  an¬ 
tenna  and  communication  antenna. Of  course, there  is  not  only  field  coupling  but  also  connection 
of  signal  lines  at  the  same  post, so  conduction  interference  exits  too.  But  this  problem  can  be  so¬ 
lved  by  means  of  co-site  grounding  design  and  necessary  wave  filtering,  thus, conduction  Interfe¬ 
rence  is  not  discussed  in  this  paper. 

We  know  that  the  directional  function  of  radar  antenna  only  gives  the  far-field  EM  distribution, 
but  near-field  EM  distribution  also  exits  because  of  the  short  distance  between  radar  and  com¬ 
munication  station.  The  arrangement  of  the  observation  post  we  studied  is  shown  as  Fig.1. 


Figure  -1 .  Layout  of  the  observation  post  we  studied 
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In  this  small  post, there  are  three  radars  and  two  microwave  relay  stations.Each  station  is  mark¬ 
ed  with  alphabets  from  A  to  F,  where  D  Is  a  remote  microwave  relay  station. 

Because  radar  E,  and  F  all  work  in  the  SM  band,  the  interference  with  communication  can  be 
ignored  when  discussing  coupling  between  antennas. 

The  EMI  characteristics  of  communication  station  B,C,and  D  caused  by  radar  A  are  as  follows: 

a) Low-frequency  partition  interference  to  communication  stations,  which  can  be  obtained 
by  means  of  Fourier  Analysis. 

b) Because  the  distances,A  to  B  and  A  to  C,are  all  about  35m, near-field  directional  func¬ 
tion  must  first  be  calculated  before  calculating  near-field  antenna  gain. 

c) The  distance  between  radar  station  A  and  communication  station  D  is  50km.  the  angle 
between  two  electric  axles  is  0.5°.  It  belongs  to  far-field  interference,  that  Is  the  coupl¬ 
ing  problem  between  wave  beams,  the  angle  of  which  is  0.5®. 

The  procedure  of  calculating  the  interference  of  B  and  D  caused  by  A  is  as  follows: 

a) Calculate  the  flux  of  interference  power  density  respectively  at  B  and  D  produced  by 
radiation  field  of  radar. 

b) Calculate  the  signal  power  of  antenna  A  at  station  B  and  that  of  antenna  B  at  station  A. 

c) Compare  the  Intensity  of  signal  and  intensity  of  interference  respectively  at  B  and  D  to 
obtain  the  interference  of  communication  B  and  D  caused  by  radar  A  . 

We  know  that  EMC  condition  is  defined  as  formula  (1): 

PA=Pt+Gt+Gr-Lj-Lf-Le  (1) 

P;^:  the  utilizable  power  (received  through  antenna)  (dbmw). 

P^:  the  power  of  transmitter  ( peak  value  of  radar  power)  (dbmw) 

G^:  antenna  gain  of  transmitter  (db) 

Gp  antenna  gain  of  receiver  (db) 

L,:  wave  propagation  loss  (db) 

Lf:  transmission  loss  of  feed  line  from  antenna  to  receiver  (db) 

Lq.  polarization  isolation  loss  of  receiver  antenna  (db) 

Formula  (2)  defines  the  condition  of  interference: 

PA-Pr  >=  0  (2) 

Pp  sensitivity  of  receiver  (dbmw) 

2.  Caculation  of  Near-field  Gain  of  Paraboloid  Antenna 
According  to  the  principles  of  antenna,  the  field  distribution  can  be  seperated  into  three  areas; 
Rayleigh,  Fresnel  Approximation,  and  Frunhofer  Approximation,  which  is  shown  as  Fig.2. 


Figure-2.  Field  integral  appoximations  for  unfocused  apertures 
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Antenna  of  radar  A  keeps  at  a  distance  of  35m  from  communication  antenna.  Together  with 
the  technical  specification  of  the  antenna  of  radar  A,  we  get:Reflector  calibre  D  =  15.5  m,Wave 
band  X  =  0.234  m, Distance  R  =35  m.Thus,R/D  =  2.26,  D/  X  =  66,  therefore,  the  field  we  studied 
falls  into  Fresnel  Approximation.  We  know  that  distribution  function  is: 

F(  ^  ,  T1 , 0)=A(  C  .  Ti ,  0)exp[-jkcp  (  C  ,  Ti ,  0)]  (3) 

where,  A:  Amplitude,  k=2  Jt  /  X  ,  qp :  Phase  . 

At  any  point  P(x,y,z),  the  field  of  scalar  quantity  caused  by  above  distribution  function  is: 

^(P)=(1/4  n  )  j  sF(  ^  ,  n  .  0)  {  [exp(  -jkr)]  /  r }  X 

X[  (jk+1/r)cosx  +  jk9  cp  /3  n  -(1/A)9  A/9  n]ds  (4) 

In  Fresnel  Appoximation  area,  the  amplitude  factor  of  Integral  function  in  formula  (4)  can  be 
made  the  following  appoximation:  1/r«k,  1/r  ^1/R,  and  cosx^cos  9  .  Generally,  amplitude 
A(  ^  ,  T) ,  0)  is  a  moderate  function  of  distance,  thus,  9  A/9  n^O.  So,  formula  (4)  can  be  made 
the  following  approximation: 

W(P)  =jexp(-jkz)/2  X  R  J  sF(  4  ,  -n  ,  0)exp{-jk[(x-  i  )2/2z  -  (y-  n  )2/2zl}X 

X(cos  9+9  cp/9  n)d  ^  d  Ti  (5) 

To  radar  A,  the  antenna  of  which  is  paraboloid,  we  can  obtain:  9^/^n  =  9cp/9z  =  1, 
F(  t ,  T1 . 0)  =  F^  (  ^  )F2(  ) ,  we  get: 

'i'(P)  =j(1+cos  0  )exp(-jkz)/2  X  Rj  5(1^1  (  ^  )exp{-jk[(x-  4  )2/2z]}d  t,  X 

X  J  svF2(  ’I  )exp{-jk((y-  p  )2/2zl}d  n  (6) 

The  deduction  procedure  is  omitted  here,  we  obtain: 

W’'=(n  2/4)  {  [cos(L It  /2)[C(1/2L1/2  .  l1/2)+c(1/2L''/2  + 

+sin(L  II  /2)[S(1/2L1/2  .  l1/2)+S(1/2L''''2  +  0'^^ 

+[sin(L  n  /2)[C(1/2L1/2  .  L''/2)+C(1/2L’'''2  +  0'^ 

+cos(L  Ji  /2)[S(1/2L''/2  -  L'I/2)+S(1/2L'</2  +  L''/2)]2}  (7) 

where,  L=R/(2d2/  X ),  C  and  S  are  Fresnel  integrals,  D  Is  the  horizontal  calibre  of  antenna,  X  is 
operation  wavebandth,  R  is  the  distance  between  antennas.  is  the  conjugate  complex  of  W. 
The  degradation  of  antenna  gain  in  the  near-field  yields:G(j=Go^^*,  where  Gq  is  the  antenna 
gain  in  far-field.  Fig. 3  shows  the  relationship  between  Gj  and  L. 


0.01  0.02  0.03  0.05  0.1  0.2  0.3  0.5  1.0 

Figure-3  Relationship  between  L  and 

To  radar  A,  L  =  35m  /  [2X(15.5m)2  /  0.234m]  =  0.017,  thus,  Gj  =  10.8db  according  to  Fig. 3. 
Therefore  the  Interference  yields:  38db  -  10.8db  =  27.2db. 
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3.  Calculation  of  the  Gain  of  Communication  Antenna  Array 

Four-Yagi-antenna  array  is  used  In  each  communication  station,  which  is  shown  as  Figure-4. 


(a)  (b) 

A:  Passive  reflection  oscillator,  B;  Active  emission  oscillator 
C:  Guide  oscillator  D;  Feeding  cable  S;  Reflection  net 
d-j,  d2:  Distance  between  two  neighbour  Yagi  antennas 
Figure~4.  (a)  Structure  of  Yagi  antenna 

(b)  Disposition  of  antenna  array 

Generally, guide  oscillators  are  apart  X  M  from  each  other, the  distance  between  reflection  osci¬ 
llator  and  active  oscillator  is  also  X  /4. Because  the  antenna  is  more  than  10m  away  from  ground, 
effect  on  radiation  caused  by  ground  can  be  ignored.  The  directional  function  is: 

6  ,  )={cos[(  n  /2)sin  9  cos  cp  ]/cos  9  }X 

X{sin[{kl/2)(1-cos  9  cosqp)]/sin[(kd/2)(1-cos  9  cosqp)]} 

X2cos[(kd  1  /2)cos  9  sin  ]X2cos[(kd2  sin  9  cos  qp )]  (8) 

where,  d(=  X  /4)  is  the  distance  between  two  guide  oscillators,  k=2  /  X  ,l=nd  (n  is  the  number  of 
oscillators).  The  approximate  maximum  gain  of  Yagi  antenna  array  is  Grnax“  (MndC)/  X  ,  where 
M  is  the  number  of  Yagi  antennas,  here  l\/I=4.  C  is  a  factor  related  to  n,  here  n=6,  C=7.8,  thus, 
^max  =I4X6X(  X  /4)X7.8]  /  X  =46.8,  10lg46.8^  17db.  The  gain  of  communication  antenna  array 
is: 

G(  e  ,  <P  )=Gn,ax[f(  ^  ^  )%ax(  ^  o.  >?>  o)J^-  (9) 

The  angle  between  electric  axles  of  radar  A  and  communication  antenna  D  is  0.5°,  according 
to  formula  (8),  fmax(  ^  o*  ^  o)“32,  f(  9  =0.01 ,  cp  =0.5O)^  32,  thus,  the  gain  of  communication  an¬ 
tenna  array  is  G(  9  =0,  cp  =0.5°)  ^  17db. 

4.  EMI  Analysis 

In  this  section,  we  will  concretely  calculate  the  EMI  of  communication  station  B  and  D  caused 
by  radar  station  A.  We  have  known  the  following  conditions: 

.  Location  relationship: 

A  is  35m  away  from  B  (distance  between  antennas)  and  at  the  same  altitude. 

B  is  50Km  away  from  D. 

A  is  50  Km  away  from  D  and  azimuth  ^  0.5°. 

.  Communication  frequency  and  power: 

Frequency  ranges  from  610~780MHz  (up  line:  B  to  D),  780'-960MHz  (down  line:  D  to  B), 
here  we  use  680MHz  (up  line)  and  780MHz  (down  line). 

Communication  power  is  5w. 

.  Radar  parameters: 

Paraboloid  calibre  is  d(widerside)Xh(narrower  side)=15.5mX5.4m, 

Maximum  gain  in  the  far-field  is  Gmax“^®^*^  (measured). 

Operation  frequency  is  fQ=1280MHz. 

Repeat  frequency  is  1/T  =  200Hz, 

Pulse  width  is  x  =4  p  s. 

Peak  power  is  P^  =  2X450w  =  900w. 
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4. 1  Coupling  prediction  between  communication  antennas  of  station  B  and  D 

Here  we  only  analyse  the  down  line  communication  (D  is  emitter, B  is  receiver),  we  know  that: 
Pt=1 0lg5X1 03=36.99dbmw, 

Gt=18db  (measured), 

G^=18db  (measured), 

Lf=3db. 

Le=0db. 

Lj=20lg(4:r  R/X)=20!g(4tr  Rf/3X10-^)  db  =  32.44+20lgf(MHz)+20lgR(Km) 

=  32.44+20lg780MHz+20lg50Km  =124.26  db.  Thus: 

Pa=  P^+G^+G,- Lj-  Lf-  Le=  -  54.27  dbmw 

The  sensitivity  of  receiver  P,.=2  V,  if  Input  resistance  is  75  ,  then,  corresponding  power 

can  be  obtained: 

Pdbmw='*  0-6V  \i  v)2/R]+30db  =  -1 20db+20lgV  ^  - 1 0lgR+30db  =  -1 02.73dbmw, 
therefore,  P/^-Pr=  -  54.27  -  (-102.73)  =  48.46dbmw. 

That’s  to  say  the  signal  power  is  48.46dbmw  above  the  sensitivity  of  receiver. 

4.2  EMI  of  communication  station  B  caused  by  radar  station  A 

Assume  that  B  is  receiver,  communication  frequency  f-i=780MHz(down  line) .According  to  the 
specification  of  radar  A,  we  get  the  power  of  transmitter  is: 

Pt=p\2  T  /  T  (fi-y  T  ]2  =  5.836X10‘5mw,  thus, 

Pt(dbmw)='IOIgPt=10lg5.836X10-5=  -42.34dbmw. 

Generally,  directional  function  of  rectangle  paraboloid  antenna  is  D=1 0.0531  (dXh)/x  2,  we  get: 
D=1 0.0531X1 5.5mX5.4m/(0.234m)2=1 5362.43,  thus, 

D(^tj=101gD=41 .86  db.  The  efficiency  of  radar  A  is  p  =60%,  so  the  gain  of  radar  antenna 
is  G=DX  p  ,  thus, 

Gdb=41 .86db  -2.2db=39.66db. 

In  fact,  the  measured  value  of  It  is  38  db.  Degradation  of  gain  (G(i=10.8)  in  near-field  (35m)  has 
been  deducted  at  the  end  of  section  2,  thus, 

Gr38-10.8=27.2db. 

The  angle  between  electric  axles  of  A(radar  antenna)  and  B(communication  antenna)  is  about 
135°.  We  get  the  values: 

Gr^  -lOdb, 

Lg^Odb, 

Lp3db, 

R=35m=3.5X10-4Km, 
f=  780MHz, 

Lj=20lg(4jrR/x)=20lg(4:rRf/3X10-'’)db=32.44+20lgf(MHz)+20lgR(Km)  =  59.82db,  thus. 
Pa  =  Pt  +  Gt  +  Gr-  Lj  -  Lf-  Lg  =  -  87.96  dbmw 
PA-Pr=  '  87.96  -  (-102.73)  =  14.77dbmw. 

That's  to  say  the  Interference  is  14.77dbmw  above  the  sensitivity  of  receiver.But  in  section  4,1 , 
we  get  the  signal  intensity  of  B  received  from  D  Is  48.46dbmw  above  the  sensitivity, so  the  signal 
power  is  33.69  dbmw  above  interference  power.  Although  considering  the  signal  degradation 
( 30db ),  the  signal  power  is  3.69db  above  interference  power.  We  obtain  the  result  that  radar  A 
doesn't  interfere  with  communication  B, 

4.3  EM!  of  communication  station  D  caused  bv  radar  station  A 

Assume  that  D  is  receiver,  communication  frequency  fi=  680MHz  (up  line).  According  to  the 
specification  of  radar  A,  we  get  the  power  of  transmitter  is: 

Pt=P"(2  T  /  T  )2(1/ jt  (f.,-y  T  ]2  =  4.053X10-5mw,  thus. 
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Pt(dbmw)=''OlgPt=10lg4.053X10“5=  -43.92dbmw. 

Because  the  distance  between  two  stations  is  50Km,  radar  antenna  gain  in  the  far-field  is 
GpSSdb  (measured).  In  section  4.2,  we  have  obtained  this  value  which  is  39.66  (calculated). 

The  angle  between  two  electric  axles  of  A  and  D  (antennas)  is  0.5°,  thus, 

Gr(  0  =0,cp=0.5°)  ^17dB.  In  fact,  the  measured  value  of  it  is  18db. 

Le^Odb, 

Lf=3db, 

R=50Km, 
f  =  esoiviHz. 

Lj=20Ig(4  n  R/  X  )=20lg(4  :t  Rf/3X10‘'')db=32.44+20lgf(MH2)+20lgR(Km)=123.07db,thus, 
Pa  =  Pt  +  G|  +  Gf  -  Li  -•  Lf  -  Le  =  - 11 3.99  dbmw 
Pa  -  Pr  =  -  113.99  -  (-102.73)  =  -11.26dbmw. 

That's  to  say  the  Interference  is  11.26  dbmw  below  the  sensitivity  of  receiver.  We  obtain  the 
result  that  radar  A  doesn't  interfere  with  communication  D. 

5.  Conclusion 

We  mainly  analysed  the  EMI  of  microwave  communication  station  B  (near-field)  and  station 
D  (far-field)  caused  by  Cm-Band  radar  (station  A)  to  provide  the  scientific  basis  for  EMC  design 
of  an  Information  Transmitting  System.  After  calculation,  we  obtained  the  result  that  no  EMI 
will  happen  in  the  ocean  observation  post  we  studied.  This  is  how  things  stand. 
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Abstract 

This  paper  deals  with  probabilistic  modeling  of  the  EM  response  of  a  complex  cavity  to 
HPM  penetration.  Theoretical  studies  indicate  that  the  field  amplitudes  and  cable  currents 
squared  should  have  a  chi  square  distribution  with  two  degrees  of  freedom,  but  our  observations 
indicate  that  a  log  normal  fit  is  empirically  better  unless  the  data  is  first  passed  through  a  trend- 
removing  filter.  The  major  innovation  reported  here  is  the  development  of  an  algorithm  which 
produces  cable-drive  fields  simultaneously  having  a  chi  square  distribution,  and  the  physically 
mandated  autocorrelation  at  a  point  as  frequency  is  swept  or  at  a  frequency  as  the  field  sensor 
is  moved  around  to  map  the  cavity  response.  This  algorithm  is  general  enough  it  can  also 
transform  (for  instance)  a  chi  square  distribution  into  a  normal  or  log  normal  distribution. 

Statistical  Introduction  and  Empirical  Background 

It  has  been  established  that  the  power  picked  up  by  a  dipole  antenna  in  an  overmoded 
cavity  obeys  a  chi  square  distribution  with  two  degrees  of  freedom.  This  statement  is  true  under 
cw  conditions,  and  applies  whether  the  observations  are  made  at  a  fixed  observation  point  while 
frequency  is  swept  or  whether  the  observations  are  made  at  a  fixed  frequency  while  the  antenna 
is  translated  or  rotated. It  does,  however,  depend  on  the  cavity  not  having  a  frequency- 
dependent  Q  and  on  all  parts  of  the  cavity  being  equally  shielded  (i.e.,  no  subenclosures  within 
the  enclosure).  If  these  postulates  are  not  obeyed,  the  antenna  power  distribution  tends  to  appear 
log  normal  instead."’^ 

In  general,  for  real-world  problems,  conditions  are  not  ideal,  and  the  dipole  antenna 
response  contains  a  "trend",  as  well  as  a  rapidly  varying  response  associated  purely  with  the 
electromagnetic  statistics.  Then  it  becomes  necessary  to  filter  out  this  trend  if  one  wishes  to  see 
the  chi  square  statistical  behavior.^'^  Price,  et  have  produced  a  lengthy  and  thorough 
derivation  of  the  physics  and  statistics  responsible  for  the  chi  square  attribute  of  the  squared  fields 
(power  flux).  However,  there  is  a  simple  approach  which  relies  somewhat  on  intuition  for 
reaching  the  same  conclusion.  Essentially,  at  any  point  in  the  enclosure,  the  field  projected  on 
any  dipole  consists  of  two  components  in  phase  quadrature,  each  of  which  approaches  a  normal 
distribution.  The  power  picked  up  by  a  dipole  antenna  is  proportional  to  the  summed  squares  of 
these  quadrature  components.  (Alternatively,  it  could  be  viewed  that  the  power  picked  up  is  the 
summed  squaes  of  the  waves  going  backwards  and  forwards.)  However,  this  sum  is  exactly  the 


This  work  was  supported  by  the  U.  S.  Air  Force  Phillips  Laboratory  under  Contract  F29601-94- 
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definition  of  a  function  which  has  a  chi  square  distribution  with  two  degrees  of  freedom. 


An  empirical  result  we  have  observed  about  data  pertaining  to  the  power  (current  squared) 
carried  by  a  cable  inside  an  enclosure  is  that  it  also  obeys  a  chi  square  distribution  with  two 
degrees  of  freedom  after  the  power  is  detrended  (see  Figure  1).  (Before  detrending,  cable  powers 
also  tend  to  appear  log  normal  in  distribution.)  This  result  is  not  predicted  or  anticipated  by  the 
Price  work.^  However,  if  the  power  is  viewed  as  the  summed  squares  of  two  phase  quadrature 
currents,  there  is  no  surprise  in  this  observation.  (There  is  an  experimentally  prompted 
assumption  here  that  the  unsquared  phase  quadrature  current  components  also  have  a  normal 
distribution,  however.) 

In  particular,  if  we  have  a  cable  inside  an  enclosure  carrying  a  sinusoidally  varying  current 
with  u  and  v  the  current  phase  quadrature  components,  the  cable  power  will  be  of  the  form 

z  =jc  +y 

Moreover,  if  z  has  a  power  density  distribution  h(z),  we  can  define 


//(co)  =  J  h(z)e~^^dz 


It  can  be  shown  that  the  density  distribution  h{z)  is  related  to  the  density  distributions  of  x  and 
yj(x)  =/(y)  by 


h(z)  =  ^  f{x)fiz  -x)dx 


so  that/(x)  is  related  to  h{z)  by 


f(x)  C 

/IT  ^ 


In  general,  if  u  and  x  are  related  by  the  transformation  function 


u  ^g(x) 
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g{)  =(f 


(6) 


then  the  distributions  of  u  and  ;c  are  related  by 

e(y)  -Ag-iy))  ^  (7) 

dy 

This  relationship  is  true  if  the  underlying  distribution  of  r  is  chi  square,  log  normal,  or  anything 
else. 


In  the  special  case  of  z  having  a  chi  square  distribution  with  two  degrees  of  freedom, 


h{z)  z>0 

-0  z<0 


(8) 


and  we  find  that/(jc)  can  be  evaluated  in  closed  form, 


fix) 


1 


(9) 


and,  thus,  e(w)  becomes 

e(„)=J_i — -2  (10) 

where  the  factor  of  2  goes  away  if  we  recognize  that,  unlike  z  and  x,  which  range  form  0  to  <», 
u  ranges  from  -oo  to  oo. 


We  thus  see  that,  if  the  cable  power  (current  squared)  obeys  a  chi  square  distribution  with 
two  degrees  of  freedom,  the  phase  quadrature  components  have  a  normal  distribution  with  zero 
mean, 


eiu)  = 


1  g-«V(2<r) 


(11) 


where  the  p  and  a  are  related  by 
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|i  =2a^ 


(12) 


Forcing  of  Autocorrelation 


If  a  cable  is  within  half  a  wavelength  of  a  conducting  wall,  the  drive  per  unit  length  on 
the  cable  is  a  zero-impedance  voltage  source  of  value 


V 


(13) 


where  h  is  the  separation  between  the  wall  and  the  cable  centerline,  and  H,,  is  the  component  of 
the  surface  current  on  the  wall  flowing  parallel  to  the  cable. 

Our  first  attempt  to  simulate  the  chi  square  power  distribution  on  a  cable  was  performed 
by  partitioning  a  cable,  modeled  after  one  in  the  EMPTAC  facility,  into  200  segments  and 
exciting  it  at  1000  different  frequencies.  (Test  data  referenced  here  was  obtained  at  the  Air  Force 
Phillips  Laboratory's  EM  effects  EMPTAC  [EMP  Test  AirCraft] 

facility.  The  EMPTAC  is  essentially  a  gutted  720  airframe.)  The  phase  quadrature  driving 
fields  on  each  segment  were  randomly  chosen  variables  from  a  normal  distribution  with  a  mean 
of  0.00  and  a  variance  of  0.01.  In  particular,  each  segment  was  driven  by  a  field 

Fj  =  [A(a),/7)  cos  cor  +5((0,/7)  sin  cor]  (14) 


where  A(co,n)  and  were  obtained  from 

C(co,w)  =Gu(W,n) 


(15) 


with  a  =  0.01  and  w(co,rt)  obtained  from 

u  (co,  n) 

j  (16) 

^27C  oo 

with  R  a  random  number  uniformly  distributed  from  0  to  1.  In  (16),  the  n  is  the  cable  segment 
index  and  CO  is  frequency. 

Unlike  actual  experimental  data,  this  drive  did  not  result  in  a  cable  power  distribution 
which  was  chi  square,  even  after  filtering  for  trend  removal.  The  problem  was  the  C(co,n) 
generated  in  this  manner  had  no  autocorrelation,  either  between  adjacent  frequencies  or  between 
adjacent  cable  segments.  Thus,  the  resulting  simulated  cable  powers,  unlike  the  experimentally 
observed  cable  powers  (see  Figure  2),  also  had  no  autocorrelation  between  adjacent  frequencies. 
The  first  attempt  to  rectify  this  problem  and  to  introduce  autocorrelation  consisted  of  massaging 
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the  m(co,«)  given  by  (16).  With  R  still  a  random  number  uniformly  distributed  between  0  and  1, 
we  associated  a  11^.^  with  each  R  through  (16). 


We  then  defined  a  new  u  through  a  formula  introducing  spatial  autocorrelation  between 
nearest-neighbor  segments 


u 


space 


M 
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saace  lii 


j-1  space  lim 


—  +/ 
Al 


(17) 


where  Uj,^  is  the  u  for  the  previous  cable  segment,  Al  is  the  cable  segment  length,  and/^^^„  is  a 
number  around  10.  (Making zero  results  in  total  correlation  between  adjacent  segments; 
making  it  >1000  results  in  no  correlation  between  adjacent  segments.)  We  next  defined  a  second 
u  through  a  formula  to  introduce  autocorrelation  between  adjacent  frequencies 


-u. ,  +/,  u 

f-l  Jfreq  sp 


X^ 

LAX 


(18) 


where  is  the  u  for  the  previous  frequency,  L  is  the  linear  dimension  of  the  cable  extent,  AX 
is  the  change  in  wavelength  between  adjacent  frequencies,  and^^^^  is  another  number  around  10. 
(Making  zero  results  in  total  correlation  between  adjacent  frequencies;  making  it  >1000 
results  in  no  correlation  between  adjacent  frequencies.)  The  actual  w(co,/j)  appearing  in  (15)  is 
«,.,of(18). 

This  procedure  succeeds  in  introducing  the  desired  autocorrelation  in  cable  power  over 
position  and  frequency  (compare  Figure  3  with  Figure  2).  Unfortunately,  it  also  succeeds  in 
distorting  the  normal  distribution  that  we  desire  the  array  of  w(a),/2)'s  to  possess,  at  least  at  the 
tails  (see  Figure  4).  Additionally,  the  standard  deviations  c  of  the  driver  ensembles  u((0,n)  and 
v(CD,«)  are  reduced  from  .01  to  about  .003. 

Restoration  of  the  Normal  Distribution 

The  normal  distribution  of  the  u{(0,n)  may  be  realized  along  with  the  desired 
autocorrelation  attribute  by  the  following  process:  Let  us  begin  by  generating  the  two- 
dimensional  ensemble  of  numbers  along  the  spatial  and  frequency  axes  according  to  (15)-(i8). 
(Actually,  we  need  two  ensembles,  one  of  which  will  eventually  become  A(co,«)  of  (14),  and  the 
other  of  which  eventually  will  become  B(co,n).)  Then  pick  a  particular  spatial  point  and  examine 
the  one-dimensional  array  which  characterizes  the  drive  versus  frequency  at  this  point.  To  the 
element  of  this  array  having  the  lowest  value,  assign  the  index  i  =  \;  to  the  next,  assign  /  =  2; 
etc.;  up  to  i  = 
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In  general,  the  elements  of  this  array  (and  even  the  elements  of  the  array  of  indices  i  listed 
versus  frequency)  will  be  in  a  sequence  containing  autocorrelation  information,  but  the  actual 
values  of  the  elements  of  this  array  will  have  been  speciously  altered  away  from  a  normal 
distribution  in  the  process  of  introducing  the  autocorrelation.  We  shall  now  reverse  this  alteration 
without  destroying  the  autocorrelation  by  utilizing  the  concept  that  even  the  array  of  indices  i 
contains  some  residual  autocorrelation  information. 

Let  us  associate  a  fraction  with  each  point  in  the  one-dimensional  array: 

If  this  array  actually  had  a  normal  distribution  with  zero  mean  and  variance,  it  would  be  true 
that,  if  Ui  were  the  value  of  element  /, 

=/;,  (20) 

where  <!>/«)  is  the  cumulative  normal  probability  distribution  function.  Since  this  should  be  true, 
we  shall  force  it  to  be  true:  Rescale  each  m,  to  obey 

u.  =<j,4);'c/;.) 

where  O;*  (f)  is  the  inverse  of  the  normal  cumulative  probability  distribution  function. 

Note  that  this  rescaling  does  not  undo  the  autocorrelation  because  the  z’s  themselves 
contain  some  autocorrelation  along  the  frequency  axis.  (Very  negative  numbers  tend  to  be 
adjacent  to  other  very  negative  numbers,  etc.)  It  is  interesting  to  point  out  that  this  procedure  can 
be  used  to  transform  the  u-s  into  an  array  with  any  desired  distribution,  irrespective  of  what  the 
underlying  initial  distribution  used  to  find  in  (16)  was.  For  instance,  if  ()  of  (21)  is  the 
inverse  chi  square  or  log  normal  cumulative  distribution  function,  the  u-s  emerging  from  (21)  will 
become  chi  square  or  log  normal  in  distribution. 

The  following  observations  now  seem  in  order: 

1.  This  rescaling  procedure  must,  or  course,  be  repeated  for  the  ID  array  at  each  of  the  spatial 
drive  points. 

2.  Autocorrelation  also  existed  along  the  spatial  axis  before  rescaling  by  virtue  of  (17)  and  (18). 
Since  we  merely  rescale,  not  resequence,  this  rescaling  will  also  preserve  this  autocorrelation 
along  the  spatial  axis. 

3.  If  we  impose  a  rescaled  normal  distribution  on  the  u-s  along  the  frequency  axis,  and  if  this 
rescaled  distribution  is  obtained  in  the  same  manner  for  each  spatial  point,  the  distribution  of 
a  ID  array  of  taken  in  the  other  direction  (frequency  constant,  spatial  position  variable)  will 
also  approach  a  normal  distribution  as  the  number  of  spatial  and  frequency  observation  points 
both  become  large. 

4.  In  this  entire  operation  of  rescaling  the  2D  ensemble,  there  is  no  reason  we  cannot  reverse 
the  roles  of  the  frequency  and  the  spatial  axes  across  the  ensemble. 

5.  Even  in  the  problem  is  extended  to  three  spatial  (and  one  frequency)  dimensions,  rescaling 
needs  only  to  be  done  in  one  direction. 
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Now  let  us  backup  some  of  these  ideas  with  results.  A  normal  random-distribution 
generating  process  set  to  give  0.00  mean  and  0.01  standard  deviation  was  used  to  initialize  all 
the  ensembles  which  follow.  Likewise,  and  were  always  set  to  10,  (which  gives  a  fair 
approximation  to  the  autocorrelation  actually  observed  in  the  EMPTAC  test  data  we  are  trying 
to  mimic;  see  Figures  2  and  3.)  Figure  5  illustrates  the  probability  plot  for  ^  being  chi 

square  in  distribution  when  the  ii  and  v  ensembles  are  created  with  autocorrelation  using  (16)-- 
(18).  On  this  probability  plot,  we  overlay  the  result  before  and  after  rescaling.  It  is  apparent  that 
the  z  ensemble  based  on  rescaled  u  and  v  is  nearly  chi  square,  but  the  z  ensemble  based  on  u  and 
V  before  rescaling  is  not.  Figure  6  illustrates  how  far  outside  the  Kolmogorov-Smimov  (K-S) 
90%  confidence  limits  for  being  chi  square  with  a  =  .01  (p  =  .0002)  the  z  ensemble  is  before 
rescaling,  and  how  close  it  is  after  rescaling.  (The  rescaled  ensemble  very  nearly  passes  the  K-S 
90%  test.) 


The  above  results  give  the  impression  that  rescaling  pretty  well  fixes  the  u  and  v  driver 
ensembles.  Consequently,  we  next  examined  the  real  and  imaginary  distributions  of  the  phase 
quadrature  components  of  the  simulated  EMPTAC  cable  currents.  Figure  7  shows  that  the  cable 
current  quadrature  components  do  not  fit  a  normal  probability  plot.  Figure  8  is  a  Kolmogorov- 
Smimov  90%  confidence  test  of  the  cable  power  distribution  (after  filtering  to  remove  the  trend). 
As  anticipated  from  Figure  7,  the  cable  power  distribution  does  not  pass  the  Kolmogorov- 
Smimov  test  for  being  chi  square,  even  after  being  passed  through  a  trend-removing  filter.  Figure 
9  shows,  however,  that  the  unfiltered  cable  power  distribution  is  excellently  matched  by  a  log 
normal  probability  plot. 


At  this  point,  we  considered  the  fact  that  we  had  not  introduced  any  mechanism  to  put 
the  cable  current  into  thermodynamic  equilibrium  with  the  electromagnetic  environment.  To 
accomplish  this,  we  computed  the  radiation  resistance  per  unit  length  of  a  cable  of  radius  a,  with 
centerline  h  over  a  ground  plane: 


R 


•  0  2 


(22) 


The  EMPTAC  tests  we  are  trying  to  match  were  performed  with  frequency  swept  from  100  MHz 
to  1  GHz.  At  317  MHz,  the  above  formula  leads  to  a  radiation  resistance  of  120  Q/m. 
Consequently,  this  resistance  was  next  inserted  in  our  cable  model.  (Part  of  the  justification  for 
doing  this  was  the  knowledge  that  an  actual  cable  at  some  fixed  observation  point  only  responds 
[correlates]  to  the  driving  field  over  a  range  of  a  wavelength  or  two.)  Figure  10  shows  a 
considerable  improvement  results  for  the  cable  current  quadrature  components  matching  a  normal 
distribution  (compare  with  Figure  7).  Also,  the  cable  power  distribution  with  trend  removed  is 
now  much  closer  to  being  chi  square  (compare  Figure  11  with  figure  8).  The  unfiltered  cable 
power  distribution  does  still  look  much  more  log  normal  than  chi  square,  however  (see  Figure 
12). 


After  all  the  above  procedures  had  been  carried  out,  a  tendency  (trend)  for  the  cable  drive 
amplitude,  and  resulting  cable  current  amplitude  to  grow  with  frequency  was  still  present.  This 
trend  was  eventually  traced  back  to  the  cable  drive-field  generating  equations  (16)"(18).  We 
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observed  that  if  the  drive-field  advance  equations  were  made  over  constant  frequency  increments 
(A/,  XV{L^)  constant),  this  equation  could  not  contribute  to  this  drive  amplitude  shifting: 
Previously,  we  had  always  used  an  exponentially  expanding  A/.  Making  A/ constant  implies  that 
the  relative  proportions  of  old  and  new  values,  i.e.,  the  effective  cannot  change  during  the 
generation  of  the  drive-field  ensembles. 

A  similar  procedure  cannot  reasonably  be  applied  to  the  equation  which  advances  the 
drive-field  along  the  position  axis,  as  keeping  (X/Ai)  constant  requires  using  a  different  cable 
segmenting  scheme  (different  M)  at  every  frequency. 

Use  of  the  (17)-(18)  scheme  where  cable  drive  field  is  first  advanced  in  the  space 
direction  results  in  the  contamination  of  the  subsequent  frequency-directed  drive  advancement  as 
well.  Consequently,  we  reversed  the  order  of  the  frequency  and  space  cable-drive  advancement 
so  the  frequency  advancement,  at  least,  could  be  kept  pure  of  changing effects.  Also,  we 
changed  these  equations  to  make  drive-field  advancement  occur  in  R  space  (varying  from  0  to 
1  in  (16)),  not  u  and  v  space.  In  other  words,  (16)  is  now  used  to  find  the  w  and  v  ensembles 
after  the  R  ensembles  have  been  generated.  Thus,  the  equations  for  creating  an  R  ensemble  now 
become 
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with  (16)  subsequently  applied  to  R,p^,,  to  find  the  u  and  v.  (Reversing  the  order  of  application 
of  (16)  and  (17)-(18)  should  be  purely  cosmetic,  and  not  change  any  actual  values  in  the  final 
rescaled  drive.)  Additionally,  we  now  are  very  careful  to  create  an  entire  2D  Rj^,^  intermediate 
arrays,  so  there  can  be  no  possible  ambiguity  about  the  definition  of  the  look-back  values  R,.^ 
and  Rj,^ . 

These  additional  precautions  lead  to  an  even  more  improved  (but  still  not  perfect) 
approach  of  the  u  and  v  ensembles  to  a  normal  distribution  (compare  the  probability  plots  of 
Figure  13  with  those  of  Figures  7  and  10).  Also,  the  filtered  cable  power  distribution  now 
approaches  chi  square  much  more  closely  (compare  Figure  14  with  Figures  8  and  11).  Also,  the 
unfiltered  cable  power  distribution  finally  begins  to  pull  away  from  the  ubiquitous  log  normal 
form  (compare  Figure  15  with  Figures  9  and  12). 

There  is  still  some  growth  trend  remaining  on  the  u  and  v  ensembles  as  a  function  of 
frequency,  even  after  all  this  correcting.  The  final  tuning  of  this  drive-field  generator  is  now 
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completed,  but  resulting  cable  current  quadrature  components  and  powers  were  not  available  at 
the  time  of  this  paper  deadline. 
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Figure  1.  Probability  plots  of  observed,  trend-removed  Figure  2.  The  EMPTAC  cable  power  has  a  central  peak 

EMPTAC  cable  powers  show  good  fit  vs  a  distribution.  several  steps  wide  in  its  frequency  autocorrelation. 


Figures.  Numerically  simulated  cable  power  auto-  Figure  4.  Forcing  autocorrelation  on  the  driver  ensembles 

correlation  after  forcing  autocorrelation  onto  cable  drivers.  makes  distribution  tails  non-normal  and  reduces  <5  by  ~.3. 
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Normal  distribution  assumed, 
avere  =— .5048e— 3  real  part 
sigre  =  .1051  e  —  1  (solid) 
aveim  =  ,4429e-3  imag  part 
sigim  =  ,1053e~1  (dashed) 
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Figure  7.  The  phase  quadrature  cable  currents  resulting 
from  rescaled  drive  do  not  fit  a  normal  probability  plot. 


Figure  8.  The  cable  power  distribution  does  not  fit  a 
K-S  lest  for  being  chi  square,  even  after  filtering. 


Figure  9.  The  unfiltered  cable  power  distribution  has  an 


excellent  match  to  a  log  normal  probability  plot. 


Figure  10.  Addition  of  radiation  resistance  makes  the 
phase  quadrature  cable  current  components  more  normal. 
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INVERSE  PROBABILITY  FUNCTION  INVERSE  PROBABILITY  FUNCTION 


Analysis  of  different  contributions  to  the  coupling  between 
reflector  antennas  on  a  satellite 


C.Park,  P.Ramanujam 

Hughes  Space  and  Communications  Company, 
Los  Angeles,  CA-90009. 


Owing  to  the  complexity  in  present  day  satellite  requirements  it  is  quite  common  that 
many  different  antennas  are  mounted  on  a  satellite.  The  power  radiated  from  some  of  these  can 
be  very  high  especially  for  direct  broadcast  applications.  For  these  applications  a  separate  re¬ 
ceive  antenna  is  generally  used  to  increase  the  isolation  between  the  transmit  and  receive  anten¬ 
nas.  Besides,  the  transmit  and  receive  antennas  should  be  optimally  located  in  order  to  achieve 
the  maximum  advantage  with  reference  to  isolation  between  the  two.  An  accurate  analysis  of 
the  coupling  between  the  two  antennas  can  aid  the  antenna  designer  for  optimally  locating  the 
different  antennas  on  the  spacecraft.  This  paper  discusses  the  procedure  and  the  techniques  used 
for  evaluating  an  antenna  coupling  on  a  typical  satellite. 

The  satellite  under  consideration  has  two  transmit  and  one  receive  antenna.  Due  to  the  me¬ 
chanical  considerations,  the  two  transmit  antennas  were  designed  to  be  of  dual  offset  reflector 
configuration,  while  the  receive  was  a  single  offset  reflector.  The  three  major  components  to  the 
coupling  are: 

(1)  Direct  transmit  horn  to  receive  horn  coupling:  This  corresponds  to  the  coupling  between  the 
feed  horns  illuminating  the  reflector  systems.  Since  the  transmit  antenna  is  a  dual  reflector  and 
the  receive  antenna  is  a  single  reflector,  the  two  feeds  directly  face  each  other  although  at  an 
angle.  Since  the  horns  are  sufficiently  separated,  a  simple  Friss  transmission  formula  was  used 
to  determine  this  component  to  the  coupling. 

(2^  Transmit  horn-subreflector  to  receive  feed  coupling:  Due  to  the  smaller  size  (in  wave¬ 
lengths  )  and  the  broad  radiation  of  the  subreflector,  this  contribution  was  also  considered  .  A 
simple  approach  to  determining  this  coupling  is  to  analyze  the  far-field  radiation  pattern  of  the 
subreflector  and  to  use  the  Friss  transmission  formula  to  obtain  the  coupling  onto  the  receive 
feed,  A  more  accurate  approach  is  to  analyze  the  radiation  pattern  of  the  subreflector  in  the 
near-field  using  the  geometrical  theory  of  diffraction  (GTD).  Both  these  approaches  were  used 
to  determine  this  component  of  the  coupling. 

(3)  Transmit  hom-subreflector-main  reflector  to  receive  feed  coupling:  This  corresponds  to  the 
coupling  to  the  receive  feed  through  the  aperture  fields  emanating  from  the  main  reflector  of 
the  dual  reflector.  This  in  turn  requires  the  evaluation  of  the  near-fields  over  the  plane  in  front 
of  the  reflector  and  containing  the  receive  feed.  Near-freld  Physical  optics  was  utilized  to  evalu¬ 
ate  these  nearfields  due  to  the  currents  on  the  main  reflector,  introduced  by  the  feed  and 
subreflector. 
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Thus  the  basic  techniques  used  for  analyzing  the  above  three  components  are  : 

1)  Geometrical  Optics(GO)  -  Friis  transmission  formula 

2)  Geometrical  Optics  and  Geometrical  theory  of  diffraction(GTD) 

3)  Near-field  Physical  Optics  (PO) 

The  numerical  results  indicating  the  contributions  due  to  the  different  components  dis¬ 
cussed  above,  obtained  using  the  reflector  antenna  analysis  software  GRASP,  will  be  discussed. 
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SIMPLE  RADIATION  MODELS  IN  LIEU  OF  EMC  RADIATED  EMISSIONS  TESTING 


Reinaldo  Perez 
Jet  Propulsion  Laboratory 
California  Institute  of  Technology 
Pasadena,  California 


Abstract 


In  mixed  analog/digital  printed  circuit  boards  the  electromagnetic 
emissions  originate  mainly  from  the  synchronous^  (or  clocked) 
circuits  present.  A  good  estimate  of  the  radiated  emissions  from  an 
electronic  equipment  can  often  be  obtained  by  analyzing  the 
electromagnetic  environment  produced  by  the  clocked  circuits  only. 
This  approach  avoids  the  need  for  extensive  radiated  emissions  EMC 
testing  over  a  very  large  range  of  frequencies  (10  KHz-22  GHz) .  The 
method  of  moments  (MOM)  is  used  to  calculated  the  radiated 
emissions  from  a  clock  circuit  in  a  printed  circuit  board  as  an 
example.  Simple  MOM  models  can  provide  good  estimates  of:  a)  the 
frequencies  and  levels  of  radiated  emissions  of  most  concern,  and 
b)  frequency  range (s)  where  accurate  emissions  measurements  needs 
to  be  made  for  compliance  with  EMC  regulatory  agencies. 


1.0  INTRODUCTION 


EMC  Testing  is  required  of  all  electronic  assemblies  which  are 
capable  of  editing  electromagnetic  radiation  above  certain 
frequencies  (e.g  lOKHz  for  space  systems) .  Commercial  (national  and 
international)  and  military  EMC  regulatory  agencies  have 
established  mandatory  requirements  (or  standards)  of  EMC  compliance 
for  all  electronic  products  manufactured  for  procurement  purposes. 
Radiated  Emissions  testing  is  one,  of  a  series  of  tests,  required 
for  EMC  compliance.  The  objective  of  the  test  is  to  assure  that  the 
electromagnetic  environment  emitted  by  newly  designed  equipment  is 
not  above  mandatory  radiated  emissions  limits  (in  db/iV/m)  for  a 
given  frequency  range.  Figure  1,  for  example,  shows  the  maximum 
allowed  radiated  electric  fields  for  aircraft  and  space  systems  in 
the  required  10  KHz-20  GHz  frequency  range  at  1.0  meter  from  the 
source.  When  a  radiated  emission  test  is  performed  it  is  required 
for  measurements  to  be  made  over  the  whole  frequency  range.  This  is 
necessary  because  test  engineers  do  not  know  a  priori  what 
electromagnetic  emissions  (frequencies  and  levels)  will  be  emitted 
from  the  designed  hardware.  It  is  not  uncommon  to  take  measurements 
over  a  large  frequency  range  in  order  to  discover  that  the  only 
emissions  present  are  those  within  a  very  limited  frequency  band. 
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A  considerable  amount  of  effort  and  time  is  often  spent  performing 
EMC  emission  tests.  EMC  testing  has  implications  in  the  overall 
cost  of  hardware  design  and  its  time-to— market  schedule.  It  would 
be  of  great  advantage  for  test  engineers  to  develop  analysis 
methods  that  will  allow  them,  not  only  to  identify  the  emission 
spectrum  of  an  electronic  assembly  before  testing,  but  also  to 
calculate,  with  certain  accuracy,  the  emission  levels  corresponding 
to  such  spectrum.  It  is  possible  to  justify  that  if  the  emission 
levels  calculated  are  significantly  below  the  allowed  emission 
limits  of  a  given  standard,  EMC  emissions  testing  could  be  deemed 
unnecessary. 


2 . 0  ANALYSIS  OF  RADIATED  EMISSIONS  FROM  A  MIXED  ANALOG/DIGITAL  PCB 


An  analysis  technique,  based  on  the  method  of  moments  (MOM) ,  is 
discussed  to  analyze  the  emissions  from  mixed  an  analog/digital 
Printed  Circuit  Board  (PCB) .  Mixed  analog/digital  PCB  have  become 
very  popular  due  to  the  increasing  need  of  many  electronic  products 
to  process  acquired  analog  data  in  a  digital  fashion.  Presently, 
most  circuit  analysis  tools  (SPICE-like  tools)  have  incorporated 
mixed  analog/digital  simulation  in  their  capabilities  to  respond  to 
the  growing  need  for  such  circuits.  Furthermore,  electronic  design 
automation  (EDA)  software  tools,  such  as  those  used  in  ASIC  and  MCM 
design,  have  also  implemented  mixed  analog/digital  analysis 
algorithms  in  their  software  structures. 

Analysis  of  the  electromagnetic  emissions  from  mixed  analog/digital 
boards  is  more  simple  since  the  electromagnetic  environment 
generated  from  such  boards  is  due  primarily  to  the  synchronous 
digital  circuits,  of  which  the  clocked  circuits  are  the  main 
culprits.  Often  in  mixed  analog/digital  boards  the  digital  portion 
occupy  less  than  50%  of  the  circuitry.  The  other  major  source  of 
radiated  emissions  from  digital  electronics  is  that  due  to  common 
mode  currents  [1]  as  manifested  in  the  radiated^  fields  from 
interface  and  power  cables.  Such  emissions,  which  in  many  cases 
have  the  largest  magnitude  in  the  radiated  spectrum,  will  not  be 
considered  here  as  will  be  explained  later. 

We  now  consider  the  electronic  assembly  described  in  Figure  2 .  It 
shows  the  Solar  Array  Deploy  Electronics  (SADE)  which  together  with 
the  Solar  Array  Deploy  (SAD)  mechanism,  the  Array  Control 
Electronics  (ACE) ,  and  the  Inertial  Mechanism  Interface  Unit  (IMIU) 
constitute  the  functional  hardware  for  deploying  solar  arrays  in  an 
spacecraft  and  satellites.  The  SADE  is  the  assei^ly  of  interest  in 
this  analysis.  A  top  view  (cover  removed)  and  side  view  (connector 
inputs)  of  the  SADE  is  shown  in  Figure  3.  The  figure  shows  the 
"analog”  and  "analog/digital"  parts  of  the  only  PCB  in  the  SADE. 
Figure  4  shows  a  partial  schematic  of  the  mixed  "analog/digital" 
part  of  the  PCB  in  the  SADE  of  Figure  3.  The  schematic  shown 
contains  the  digital  circuits  which  are  part  of  the  SADE  board  and 
only  a  portion  of  the  analog  circuits.  The  digital  circuits 
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involved  are  an  A/D  converter  and  four  Flip/Flops  as  shown  in  the 
figure.  The  SADE  PCB  is  a  multilayer  board  (5  layers) .  The  board 
contains  one  power  plane,  one  ground  plane  and  three  signal  planes. 
Layer  5  (a  signal  plane)  is  where  the  clock  traces  of  the  digital 
circuits  are  located.  Figure  5  shows  the  signal  and  clock  traces  of 
the  circuits  on  layer  5.  The  clock  frequency  is  3  MHz. 

In  this  analysis  the  radiated  emissions  caused  by  the  clock  are  of 
greatest  importance.  Figure  6  shows  only  the  clock  traces  on  layer 
5  and  it  also  illustrates  some  of  the  aspects  in  the  method  of 
moments  modeling. The  method  of  moments  code  automatically  creates 
an  image  of  the  clock  trace  due  to  the  presence  of  the  ground 
plane.  The  clock  pulse  is  of  trapezoidal  shape  (50%  duty  cycle, 
only  odd  harmonics  are  present)  with  5V  magnitude  and  rise 
time/ fall  times  of  30nS.  The  spectrum  for  the  first  several 
harmonics  is  shown  in  Table  1.  The  magnitude  spectrum  of  the  clock 
pulse  (i.e  frequency  .vs.  voltage  level)  is  calculated  as  given  by 
equation  1  [2]. 


v{n) 


=  2*D*V*\ 


Sin  {n*n*D) 
{n*n*D) 


Sin  (n*n*t-/T) 

+  _ = - 

(n*n^t^/T) 


(1) 


where  V  is  the  peak  amplitude  in  volts  (5V) ,  D  is  the  duty  cycle 
(pulse  width/T)  ,  t^.  is  the  rise  time  (seconds)  ,  T  is  the  period 
(units  per  second) ,  and  n  is  the  harmonic  number. 

The  modeled  voltage  source  in  Figure  6  is  frequency  dependent  as 
shown  by  equation  1.  The  magnitude  voltages  are  given  for  each 
frequency  based  on  the  calculated  spectrum  (all  harmonics)  of  the 
clock  pulse.  The  clock  traces  are  modeled  by  equivalent  round  wires 
as  shown  by  equation  2  [ 3 ] . 

R  =  0,335*W*{0.8  +  t/m  (2) 

for  0.1  <  t/W  <  0.8,  where  W  is  the  width  of  the  microstrip  in 
meters,  t  is  the  thickness  of  the  microstrip  in  meters  and  R  is  the 
wire  radius  in  meter.  The  wire  diameter  is  calculated  to  be  0.82mm 
diameter.  Though  not  necessary,  in  the  MOM  code  we  assumed  perfect 
conductivity  for  the  microstrip  traces  in  the  PCB.  Since  the 
fundamental  frequency  is  only  3  MHz,  it  is  assumed  that  at  such  low 
frequencies  (including  harmonics)  the  dielectric  losses  in  the  PCB 
are  minimum  [4],  hence,  dielectric  effects  are  not  included.  The 
total  number  of  segments  used  in  the  MOM  modeling  is  24.  Though 
chip  loading  could  have  been  included  [2]  it  is  neglected  in  this 
work  for  simplicity.  To  assess  the  validity  of  this  analytical 
approach,  the  fields  emitted  by  the  SADE  were  also  measured  in  an 
anechoic  room. 
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3.0  MEASURED  AND  CALCULATED  RESULTS 


The  measured  results  were  obtained  as  described  in  Figure  7.  In  the 
measurement  procedure  the  top  cover  of  the  SADE  was  removed  (see 
Figure  3)  and  the  motor  drive  board  (see  Figure  2)  was  disabled. 
Table  2  shows  the  calculated  and  measured  radiated  emissions  at  1.0 
meter  distance  (as  required  per  Figure  1)  from  the  source  (PCB)  for 
several  harmonics  of  the  fundamental  clock  frequency.  The  results 
show  that  a  better  accuracy  between  the  calculated  and  measured 
data  can  be  obtained  at  higher  harmonics  of  the  fundamental  clock 
frequency. 

Finally,  we  are  ready  to  assess  analytically  the  electromagnetic 
emissions  from  the  completely  assembled  SADE  (with  cover)  which  is 
its  noraal  configuration.  EMC  emissions  testing  requires  the  normal 
configuration.  The  approximate  shielding  effectiveness  (SE)  of  the 
SADE  enclosure  (thickness  of  0.15  inch  aluminum  with  10  venting 
round  holes,  0.25  inch  diameter)  is  calculated  per  a  procedure 
described  in  [5].  The  venting  holes  must  be  considered  in 
calculating  SE  since  SE  depends  on  the  diameter  of  holes,  number  of 
holes  and  their  arrangement.  For  example,  SE  decreases  as  frequency 
increases  and  hole  diameter  increases. 

Table  2  also  shows  the  external  radiated  fields  after  shielding 
effectiveness  is  factored  in.  The  results  show  that  the  external 
emissions  are  very  much  well  below  the  limits  of  Figure  1.  For 
example,  at  33  MHz  the  emissions  after  SE  is  -31.2  dBjLiV/m  at  1.0 
meter  (0.027  /iV/m)  which  is  somewhat  below  the  noise  floor  level  of 
most  spectrum  analyzer.  Though  emissions  can  also  come  from  the 
cables  connected  to  the  SADE  (due  to  the  common  mode  current)  , 
flight  cables  in  spacecraft  are  normally  completely  shielded 
(including  connectors)  and  well  grounded.  Several  layers  of  shields 
are  used  on  the  cables.  The  emissions  from  such  cables  is  minimum. 
Furthermore,  though  some  emissions  could  have  been  expected  from 
the  motor  drive  board  they  are  in  the  low  KHz  range  (<  30KHz  for 
this  board)  and  mostly  confined  within  the  SADE  enclosure.  It  can 
then  be  concluded  from  this  analysis  that  there  is  no  need  to 
conduct  EMC  emission  testing  for  this  particular  assembly. 

In  many  mixed  analog/digital  PCB  it  is  often  the  case  than  more 
than  one  clock  frequency  is  present  for  different  synchronous 
circuits  within  the  same  PCB.  Though  it  could  be  said 
simplistically  that  the  above  analytical  procedure  can  be  repeated 
for  each  of  the  different  circuits  involved  and  then  superimpose 
the  results,  the  correct  approach  is  much  more  complicated.  In  the 
near  field  the  coupling  of  electromagnetic  fields  to  other 
radiating  and  non-radiating  wire  structures  (e.g  traces,  connector 
pins,  chip*s  connectors)  within  the  PCB  must  also  be  considered  due 
to  the  induced  current.  Additional  work  needs  to  be  done  in  this 
area. 


326 


4,0  CONCLUSION 


EMC  testing  is  usually  the  last  "hurdle"  for  qualification  of  an 
electronic  product  to  market.  Full  scale  EMC  tests  are  long  and 
costly.  Considerable  savings  could  be  achieved  if  EMC  testing  could 
be  reduced  in  scope,  only  performing  the  testing  that  is  needed. 
Computational  electromagnetics,  and  in  this  particular  case  the 
method  of  moments,  can  be  useful  in  assessing  the  need  for  radiated 
and  conducted  emission  EMC  testing  of  a  prototype  assembly. 
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Harmonic 

Frequency 

Harmonic 
Amplitude 
in  (dB) 

Harmonic 

Amplitude 

in 

(volts) 

1 

3  MHz 

-4.038 

3.14 

2 

6  MHz 

-151.578 

o 

o 

i 

3 

9  MHz 

-14.533 

0.938 

4 

12  MHz 

-153.048 

-0.0 

5 

15  MHz 

-21.017 

0.4447 

6 

18  MHz 

-176.595 

-0.0 

7 

21  MHz 

-27.500 

0.2108 

8 

24  MHz 

-160.465 

-0.0 

9 

27  MHZ 

-36.124 

0.078 

10 

30  KHz 

-166.558 

o 

o 

i 

11 

33  MHz 

-64.66 

0.003 

Table  1.  Spectrum  of  the  3  MHz  clock  in  the 
PCB  for  several  harmonics. 


Frequency 

E(dBAtV/m) 
Calculated 
at  1.0  m 

E(dBjUV/m) 
Measured 
at  1.0  m 

A(dB/iV/m) 

E(dB/iV/m) 
Calculated 
after  SE 

3  MHz 

17.4 

5.2 

12.2 

-85.6 

9  MHz 

22.5 

10.8 

11.7 

-81.2 

15  MHz 

34.6 

23.7 

10.9 

-74.6 

21  MHz  ' 

40.15 

31.5 

8.60 

-66.1 

27  MHz 

49.74 

41.9 

7.84 

-54.8 

33  MHz 

67.84 

60.1 

7.74 

-31.2 

Table  2 •  Comparison  of  Calculated  and  Measured  Electric  fields  for 
Clock  Circuit  and  the  Effect  of  Shielding  Effectiveness 


Limit  Levei  (dBuV/m) 
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Figure  5.0  Signal  and  Clock  Traces  on  Layer  5  of  PCB 


Figure  6.  MOM  Modeling  of  Clock  Traces  in  PCB 


332 


SESSIONS: 

OPTIMIZATION  TECHNIQUES  IN 
APPLIED  ELECTROMAGNETICS 


Chair:  O.A.  Mohammed 


333 


AN  OPTIMIZATION  APPROACH  TO  REDUCE  THE  DISCRETIZATION  ERROR 
IN  FINITE  ELEMENT  EXPLICIT  SOLUTION  SCHEME 
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Via  Eudossiana  18,  00184  Rome,  Italy 

Abstract  -  Two  explicit  solution  schemes  of  Maxwell’s  curl  equations  based  on  the  leap-frog 
algorithm  are  presented.  The  first  method  is  based  on  a  finite  element  discretization  of  space  and  a 
finite  difference  integration  of  time  and  leads  to  an  explicit  solution  scheme  by  applying  the  mass 
lumping  algorithm.  The  results  produced  by  this  method  can  be  inaccurate  when  nonuniform  meshes 
are  adopted  since  the  mass  lumping  algorithm  depends  strongly  on  the  shape  of  the  elements.  The 
second  explicit  solution  scheme  is  based  on  a  mixed  finite  element-finite  difference  discretization  of 
space  and  a  finite  difference  integration  of  time.  In  order  to  avoid  numerical  instabilities  and  to  improve 
the  accuracy,  an  optimization  problem  is  defined  to  find  a  mesh  that  minimizes  the  discretization  error. 
In  the  paper  the  salient  features  of  a  simulated  annealing  (SA)  algorithm  are  described  to  generate  an 
optimized  two  dimensional  triangular  mesh. 

I.  INTRODUCTION 

New  numerical  methods  based  on  vector  finite  elements,  edge  elements  and  Whitney  elements  have 
been  recently  developed  to  solve  Maxwell's  curl  equations  in  time  domain  adopting  the  leap-frog 
scheme  [l]-[6].  These  methods,  named  as  finite  element-time  domain  (FE-TD)  methods,  are  very 
powerful  since  explicit  solution  algorithms  reduce  computation  time  dramatically  and  finite  elements 
are  very  convenient  to  model  complex  geometrical  configurations. 

In  this  paper,  a  two-dimensional  problem  is  analyzed  by  two  different  procedures.  In  the  first 
method  the  finite  element  interpolation  is  used  to  solve  the  fields  spatially:  the  vector  magnetic  field  is 
approximated  by  edge  elements  while  the  scalar  electric  field  is  assumed  constant  in  each  element  [4]- 
[5].  The  resulting  system  of  equations,  obtained  by  the  Galerkin  formulation,  is  nondiagonal.  The  mass 
lumping  algorithm  is  applied  to  diagonalize  the  system  and  an  explicit  solution  scheme  is  obtained.  The 
accuracy  of  the  method  is  strongly  dependent  on  the  shape  of  the  triangular  elements:  the  method 
performs  well  when  grids  composed  by  equilateral  triangular  elements  are  used  since  the  dispersion 
relation  is  second  order  accurate  [5],  but  the  accuracy  decreases  for  nonregular  meshes  since  right  and 
nonacute  triangles  lead  to  zero  or  negative  masses  which  produce  numerical  instabilities.  In  this  cases 
the  second  method  is  applied:  the  diagonalization  of  the  mass  matrix  is  avoided  by  adopting  a  mixed 
finite  element-finite  difference  discretization  of  space  [1].  This  approach  is  less  sensitive  to  the  element 
shape  and  does  not  present  stability  problems. 

In  practical  applications  it  is  not  possible  to  use  equilateral  triangular  meshes  to  discretize  complex 
configurations.  W^en  a  nonregular  mesh  composed  by  elements  of  different  shape  and  size  is  adopted, 
the  main  problem  of  the  numerical  procedures  consists  in  maintaining  good  accuracy  in  the  field 
solution.  Therefore  global  optimization  techniques  are  used  in  order  to  find  a  good  discretization  of  the 
domain  by  triangular  elements  which  ensues  good  numerical  accuracy  and  stability.  The  optimization 
strategy  consists  in  minimizing  by  a  simulated  annealing  (SA)  procedure  [7]- [8]  the  error  due  to  the 
space  discretization  modifying  the  position  of  the  mesh  nodes  and  element  shapes  continuously.  An 
illustrative  example  is  proposed  to  test  the  validity  of  the  two  different  procedures. 
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n.  Mathematical  FORMULATION 

Let  us  consider  the  following  form  of  Maxwell's  curl  equation  in  time  domain: 


VxH  -  e- 


VxE  +  [i—  =0  (lb) 

a 

where  E  and  H  are  the  electric  and  magnetic  fields,  and  \i  and  s  are  the  permeability  and  permittivity. 
In  order  to  simplify  the  mathematical  formulation,  the  study  is  limited  to  a  two  dimensional  problem, 
i.e.  Ex=Ey=Hz=0. 

n.l  Mass  Lumping  Method 

Discretizing  the  region  in  triangular  elements,  the  magnetic  field  in  each  element  is  approximated  by 
edge  elements  as  [2]; 

H(r,t)s  E  (2) 

k=l 

where  r  is  the  position  vector,  the  length  of  the  k-th  edge,  H^k  the  magnetic  field  component 
along  the  k-th  edge,  Wk  the  Whitney  1-form  vector  shape  function  associated  to  the  k-th  edge  [4]. 

The  electric  field  E  is  assumed  to  be  piecewise  constant: 

E(r,t)-^c(r)E^(t)z  (3) 

where  Ez®  is  the  constant  electric  field  inside  the  considered  element  and  Xq  the  trial  function  associated 
with  the  node  located  in  the  element  barycenter  satisfying  the  condition:  for 

for  The  adopted  field  approximations  are  shown  in  Fig.l. 

Applying  Galerkin  method  to  (1)  using  the  field  interpolations  (2)  and  (3),  the  following  equations 
are  obtained  [4] : 


3 


E®  J’^eXc(Vx  Wh).zdn  +  n®  E  — J^eWk-Whdf2  =  ( 


where  wj,  is  the  weighting  function  with  h=l,2,3,  and  ft®  and  s®  are  the  local  permeability  and 
permittivity,  respectively. 

The  systems  of  equations  (4)  expressed  in  matrix  form  are: 


dE 

[K®]t[H^]-^  =0 
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[f]  E®+[T®]  ^^^  =  0  (5b) 

^  dt 

where  the  coefficients  of  matrix  [T^]  and  vectors  [K^],  [J®]  are  given  by: 

(6a) 

A  £ 

J"k=L(Vxwk)-2dn  (6b) 

w^-w^dQ  (6c) 

where  is  the  element  area. 

Assembling  the  local  systems  (5),  the  following  global  systems  are  obtained: 

[KyH,]-^  =  0  (7a) 

[J][E.]  +  [T]M  =  0  (7b) 


The  components  of  vectors  [Ez]  and  [Hx]  are  equal  to  the  number  of  elements  and  edges, 
respectively. 

Time  domain  is  discretized  by  applying  the  leap-frog  technique:  the  electric  field  is  calculated  at  the 
time  t"=nAt,  while  the  magnetic  field  is  calculated  at  t^+^^2=(n+l/2)At,  where  n  is  the  number  of 
iteration  and  At  the  time  step.  Substituting  time  derivatives  in  (7)  by  central  finite  differences,  the 
following  final  systems  of  equations  are  obtained: 


(a)  (b) 

Fig.  1.  Vector  magnetic  field  (a)  and  scalar  electric  field  (b)  approximations  in  a  triangular  finite  element. 
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The  electric  field  vector  [E^]  is  directly  calculated  by  (8a):  at  each  time  step,  time  derivative  of  the 
electric  field  E^p  in  the  p-th  element  depends  on  the  three  components  of  the  magnetic  field  tangential 
to  the  edges  of  the  considered  element,  as  shown  in  Fig. 2a.  The  calculation  of  the  tangential 
components  of  the  magnetic  field  [H^]  is  performed  by  (8b):  time  derivative  of  the  tangential  magnetic 
field  component  depends  not  only  on  the  electric  field  in  the  p-th  and  q-th  elements  which  share 
the  k-th  edge,  but  also  on  the  time  derivative  of  the  magnetic  field  tangential  to  the  other  edges 
constituting  the  p-th  and  q-th  triangles,  as  shown  in  Fig. 2b.  Since  the  mass  matrix  [T]  is  consistent  or 
nondiagonal,  the  system  of  equations  (8b)  must  be  solved  at  each  time  step.  In  order  to  obtain  an 
explicit  solution  scheme,  the  mass  matrix  [T]  is  diagonalized  by  applying  an  opportune  quadrature 
scheme,  the  mass-lumping  algorithm,  to  calculate  the  integral  in  (6c).  Using  the  tangential  components 
of  the  magnetic  field  at  mid-points  of  the  edges  as  quadrature  points,  the  integral  in  (6c)  can  be 
approximated  as  follows  [4]-[5]: 


j^c Wh-Wkdf2=  SiWh(rj,k)-Xj,k  wk(rj,k)-Xj,k  + 


Sj Wh(ri,k)-Xi,k  Wk(ri,k)-Xi,k  +Sk Wh(ri,j)’Xi,j  Wk(ri,j) *X ij  (9) 

where  x  ij  is  the  unit  vector  tangential  to  the  edge  between  i-th  and  j-th  nodes,  ry  =  (ri+rj)/2  is  the 
middle  point  of  the  edge  between  i-th  and  j-th  nodes,  and  Si  is  the  weight  given  by  the  following 
expression: 


g.  ^  (10) 

*  2 

where  is  the  length  of  the  i-th  edge,  hj  is  the  height  of  the  triangle  for  the  i-th  node  and  dj  the 
oriented  distance  between  the  ortocenter  O  of  the  element  and  the  i-th  edge  according  to  the  notation 
shown  in  Fig.3.  Sj  and  Sk  are  given  by  expressions  analogous  to  (10). 

Using  this  quadrature  scheme  the  method  is  efficient  when  the  mesh  is  composed  by  acute  triangles. 
When  the  ortocenter  of  the  element  coincides  with  one  vertex  of  the  element  (i.e.  right  triangle)  or 
falls  outside  the  element  (i.e.  nonacute  triangle),  the  mass  lumping  algorithm  leads  to  zero  or  negative 
masses  respectively,  and  the  method  becomes  unstable. 


Fig,  2.  Explicit  calculation  of  the  scalar  electric  field  £3  (a)  and  implicit  calculation 
of  the  magnetic  field  tangential  component  Rj-  (b). 
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Fig.  3.  Notation  adopted  in  the  calculation  of  the  weight  Sj  in  the  mass  lumping  algorithm 


n.2  Mixed  Finite  Element-Finite  Difference  Method 

An  explicit  solution  scheme  can  be  obtained  directly  by  adopting  a  mixed  finite  element-finite 
difference  discretization  of  space  [1].  Applying  the  finite  element  field  approximation  (2)  and  Galerkin 
method,  equation  (8a)  is  still  derived.  The  electric  field  is  assumed  to  be  piecewise  constant  as  in  the 
first  method  and  the  tangential  component  of  the  magnetic  field  is  obtained  by  a  finite  difference 
approximation.  The  k-th  component  is  then  given  by: 


TTn+1/2  ^n-Ml 


At 


2 

|iq  +  \Xp 


n 


(11) 


where  dqp  is  the  distance  calculated  normally  to  the  k-th  edge  between  the  two  electric  field  nodes 
which  share  the  edge,  as  shown  in  Fig. 4.  In  this  way  the  tangential  component  depends  only  on 
the  electric  field  in  the  p-th  and  q-th  elements. 

Assembling  (11)  the  following  explicit  system  is  obtained; 


At 


-  [M]  [Ez]" 


(12) 


where  [M]  is  a  finite  difference  matrix  with  only  two  nonzero  coefficients. 

The  method  based  on  explicit  solution  of  (8a)  and  (12)  is  stable  but  the  accuracy  depends  clearly  on 
the  shape  of  the  elements. 

It  should  be  noted  that  this  approach  leads  to  the  same  solution  scheme  obtained  by  using  Whitney 
1-  and  Withney  2-  forms  in  a  two-dimensional  configuration  [6]. 


Fig.4.  Explicit  calculation  of  the  magnetic  field  tangential  component 
in  the  mixed  finite  element-finite  difference  method. 
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III.  Mesh  Optimization  by  Simulated  Annealing 


When  it  is  not  possible  to  use  regular  meshes,  the  discretization  error  can  lead  to  insufficient 
accuracy  in  the  field  solution.  In  this  case  the  main  problem  is  to  find  a  mesh  that  minimizes  the 
discretization  error.  The  problem  of  finding  a  good  discretization  of  the  domain  can  be  formulated  as 
an  optimization  problem:  for  a  given  configuration  the  aim  of  the  optimization  procedure  is  to  look  for 
the  position  of  the  nodes  such  that  the  error  produced  by  the  discretization  is  minimum.  The  element 
error  can  be  defined  as: 

Ee=l-Qe  (13) 

where  is  the  quality  factor  of  a  triangle  given  by  [10]: 

Qe  =  8  (Le  -  (Le  -  (L^  -  42)  /  (14) 

where  L®  is  the  triangle  semi-perimeter  L®  =  +  4^y2. 

The  objective  function  which  must  be  minimized  is  defined  as  the  sum  of  the  errors  associated  to 
each  element.  A  constraint  is  also  imposed  to  respect  the  stability  condition  of  the  FE-TD  methods 
which  states  that  the  time  step  At<  hjyun/c,  where  hjjjiji  is  the  minimum  length  of  the  edges  and  c  the 
propagation  velocity. 

Let  X  be  a  vector  of  2N  components,  given  by  the  coordinates  of  the  N  nodes  internal  to  the 
domain,  andy(x)  the  function  to  be  minimized,  we  state  the  optimization  problem  as  follows: 

find  the  vector  x  such  that: 
f(x)  min 

with  each  of  the  2N  variables  ranging  in  a  finite,  continuous  interval. 

Simulated  annealing  (SA)  algorithm  in  the  context  of  generating  a  two-dimensional  mesh  for  finite 
element  computation  is  analyzed.  The  simulated  annealing  is  a  well  known  powerful  global 
optimization  algorithm,  used  in  combinatorial  optimizations  [7]-[8].  It  is  based  on  random  moves,  and 
avoids  entrapment  in  local  minima  by  allowing  uphill  moves,  controlled  by  means  of  an  acceptance 
criterion.  The  SA  here  used  is  derived  from  the  simulated  annealing  algorithm  for  functions  of 
continuous  variables  proposed  in  [9]. 

The  algorithm  proceeds  iteratively:  starting  from  the  point  Xq  it  generates  a  sequence  of  points  x^, 
Xj,  xj  ,..  until  it  reaches  the  global  minimum  of  the  objective  function.  At  the  m~th  step  of  the 
simulated  annealing  algorithm,  a  new  configuration  x^i+l  points  is  generated  by  random 
perturbation  of  the  previous  state  x^,.  The  random  perturbation  is  obtained  by  changing  the 
coordinates  of  the  nodes  in  turn. 

If  the  value  of  the  function  X^m+l)  ^^ss  than  ^Xm),  A/  <  0  and  Xjjj+j  is  always  accepted, 
however,  if  A/"  >  0  (uphill  moves),  is  accepted  with  a  Boltzmann  distribution  probability; 

P(A/)=e*^^^  (15) 

where  T  is  a  control  parameter  called  temperature.  The  method  starts  with  a  high  value  of  T  and 
during  the  minimization  process  it  is  gradually  lowered  in  order  to  focus  to  the  minimum.  An 
established  number  Mj^^x  configurations  is  generated  at  each  temperature  value  until  the 
equilibrium  is  reached.  The  probability  that  an  uphill  move  will  be  accepted  diminishes  as  the 
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temperature  declines  until  no  function  cost  increasing  are  accepted  anymore. 

The  algorithm  has  two  termination  criteria  to  decide  if  the  global  minimum  has  been  reached.  When 
at  the  temperature  value  Ty  the  configurations  have  been  proved,  let  fy  to  be  the  current  value 
of  the  search  is  stopped  if 

|/v-/v-»|^P.  u  =  l,.,4 

and: 

/v-/(*opt)^P 

where  p  is  a  small  quantity  which  has  to  be  opportunely  chosen  and  /(xopt)  is  the  optimum  current 
value  of  the  objective  function. 

IV.  APPLICATION 


The  two  methods  are  applied  to  solve  a  very  simple  problem  as  the  plane  wave  propagation  in  a 
homogeneous  two-dimensional  domain.  First,  a  coarse  mesh  is  generated  as  shown  in  Fig. 5a.  The 
method  based  on  the  mass-lumping  algorithm  is  unstable  since  the  mesh  presents  many  nonacute 
triangles  while  the  mixed  finite  element-finite  difference  method  is  stable  but  does  not  produce  accurate 
results. 

Second,  the  proposed  SA  procedure  is  applied  to  build  an  optimized  mesh  as  shown  in  Fig.  5b.  The 
first  method  is  still  unstable  while  the  finite  element-finite  difference  method  is  stable  and  the  results 
(non  reported  in  this  paper  for  brevity)  are  accurate. 


Fig.  5  Nonoptimized  mesh  (a)  and  optimized  mesh  (b). 
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The  method  based  on  the  mass  lumping  algorithm  produces  accurate  results  only  when  the  mesh  is 
composed  by  equilateral  triangles.  In  this  case  the  numerical  expressions  of  the  two  methods  coincides. 
It  should  be  pointed  out  that  the  optimized  mesh  (see  Fig.  5b)  is  very  similar  to  a  mesh  obtained  by  the 
application  of  Voronoi  diagrams  [11],  This  result  is  relevant  because  it  permits  to  generate  a  priori 
mesh  based  on  Voronoi  polygons  without  using  any  optimization  technique.  It  yields  a  big  reduction  of 
computational  cost  in  the  preprocessing  and  accuracy  in  the  field  solution  results  when  the  mixed  finite 
elements-fmite  difference  method  is  utilized.  If  an  automatic  mesh  generator  based  on  Voronoi 
polygons  is  used,  care  should  be  taken  to  respect  the  stability  criterion  of  the  FE-TD  methods. 

V.  CONCLUSIONS 

Two  explicit  methods  based  on  the  leap-frog  scheme  have  been  presented  for  the  solution  of 
Maxwell's  curl  equations  The  first  method,  based  on  a  finite  element  discretization  of  space  and  a  finite 
difference  integration  of  time,  leads  to  an  explicit  solution  scheme  by  applying  the  mass  lumping 
algorithm.  The  accuracy  is  strongly  dependent  on  the  shape  of  the  elements:  the  mesh  must  be 
constituted  by  nonacute  triangles  to  ensure  the  stability  of  the  method. 

The  second  method,  based  on  a  mixed  finite  element-finite  difference  discretization  of  space  and  a 
finite  difference  integration  of  time,  produces  a  solution  which  is  less  sensitive  to  the  element  shape  and 
does  not  have  stability  problems. 

An  optimization  problem  has  been  defined  to  find  a  FE  mesh  which  produces  accurate  results 
avoiding  as  far  as  possible  nonacute  triangles  in  a  two-dimensional  mesh  and  proximating  all  the 
triangles  to  equilateral  triangles.  This  mesh  is  very  similar  to  a  mesh  obtained  by  the  use  of  Voronoi 
polygons,  when  the  FE-TD  stability  criterion  is  respected. 
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abstract  -  Analysis  and  design  of  a  reentrant  resonant  cavity  applicator  are  investigated.  The  electricconstants 
have  been  measured  by  the  infinitesimal  coaxial  probe  method.  Comparisons  of  calculated  and  measured  temperature 
distributions  for  a  disk-shaped  phantom  show  that  it  is  necessary  to  consider  the  temperature  dependences  of 
relative  permittivity  e^and  electric  conductivity  cr  of  the  phantom.  By  attaching  cylindrical  conductors  to  a  reentrant 
resonant  cavity  applicator,  electromagnetic  energy  is  directed  toward  a  specific  part  of  a  long  phantom  and  temperature 
rise  in  a  deep  region  is  achieved. 


I  INTRODUCTION 

It  is  important  to  obtain  deep  and  concentrated  heat  in  a  hyperthermia  system.  For  a  radio 
frequency  hyperthermia  system,  the  annular  phased  array  [1]  or  its  improved  method  [2]  have  been 
developed.  However,  it  has  been  pointed  out  that  there  are  still  some  difficulties  to  heat  deep-seated 

cancer  tumors  selectively  [3].  u  i  ,  •  rci 

Recently,  a  reentrant  resonant  cavity  applicator  has  been  utilized  [4].  Both  calculations  [5] 
and  measurements  [6]  have  been  earned  out,  however,  only  axisymmetrical  models  have  been  examined, 
which  is  not  enough  for  a  practical  application.  Moreover,  it  has  not  been  possible  to  understand  the 
phenomena  because  the  frequency  and  temperature  dependences  of  the  electric  and  thermal  constants 

for  the  phantom  have  been  neglected. 

In  this  paper,  analysis  and  design  of  a  reentrant  resonant  cavity  applicator  are  investigated.  In 
experiments,  dielectric  phantoms  (eg.  TX-150)  are  commonly  used,  therefore,  it  is  important  to  measure 

the  frequency  and  temperature  dependences  of  relative  permittivity  electric  conductivity  a,  thermal 
conductivity  X,  specific  heat  capacity  c  and  the  volume  density  of  mass  p.  By  using  the  infinitesimal 
coaxial  probe  method  [7],  it  is  confirmed  that  becomes  smaller  and  <5  becomes  larger  as  the 
temperature  rises,  which  means  considering  the  characteristics  of  £j.  andG  vs,  temperature  is  indispensable. 

We  have  also  confirmed  by  calculations  that  the  temperature  dependences  of  X,  c  and  p  can  be 
neglected.  The  frequency  dependences  (50-100  MHz)  of  these  parameters  are  found  to  be  not  vital  for 
our  system. 

Next,  the  calculated  and  measured  temperature  distributions  of  a  disk- shaped  phantom  (240 
mm  in  diameter  and  250  mm  in  thickness)  are  compared.  The  3-D  finite  difference  -  time  domain 
method  [3]  and  finite  element  method  are  used  to  obtain  electromagnetic  field  and  temperature 
distributions,  respectively,  where  the  iterative  calculations  are  carried  out  considering  the  temperature 
dependences  of  8,  and  o.  Close  agreement  between  calculated  and  measured  temperature  distributions 
in  the  phantom  show  the  validity  of  the  calculations. 

In  order  to  optimize  the  cavity  structure,  the  temperature  distributions  of  a  long  cylindrical 
phantom  (1500  mm  in  length  and  200  mm  in  diameter)  are  calculated.  Cylindrical  conductors  are  used 
[8]  to  direct  the  electromagnetic  energy  toward  a  specific  area.  Calculated  results  show  that  directed 
heat  in  a  deep  region  can  be  obtained,  which  leads  to  the  possibility  of  the  reentrant  cavity  applicator 
with  cylindrical  conductors  being  used  for  a  deep-seated  cancer  tumors. 
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II  Reentrant  Resonant  Cavity  applicator  [4] 

Figure  1  shows  the  schematic  structure  of  the  reentrant  resonant  cavity  applicator.  In  order  to 
direct  the  electromagnetic  energy  toward  a  specific  area  of  the  body,  this  system  has  been  invented. 
Inside  the  applicator,  there  is  enough  area  for  one  adult.  Precise  dimensions  are  shown  in  section  IV. 
The  block  diagram  of  the  hyperthermia  system  is  shown  in  the  same  figure.  First,  resonant  frequencies 
of  the  cavity  are  measured.  When  the  electromagnetic  energy  is  applied  through  one-turn  coil  coupled 
with  the  variable  capacitor,  a  resonant  state  is  kept  automatically  using  an  impedance  controller.  In 
our  experiment,  the  lowest  resonant  frequency  around  50-70  MHz  is  used.  Throughout  this  paper,  the 
electromagnetic  energy  of  300  Watts  has  been  applied  for  30  minute  increments.  Next,  the  phantom  is 
cut  and  the  temperature  distribution  is  measured  by  the  thermogram,  however  it  should  be  noted  that 
the  exact  input  power  is  unclear  because  some  energy  is  consumed  by  the  coil  and  capacitor.  Therefore, 
we  evaluate  the  temperature  rise  as  arbitrary  in  our  calculations. 


Fig.  1  Schematic  structure  and  block  diagram  of  a  reentrant  resonant  cavity  applicator. 


III  MEASUREMENTS  OF  ELECTRIC  AND  THERMAL  CONSTANTS 

In  the  experiment,  dielectric  phantom  (eg.  TX-150)  is  commonly  used.  The  dependences  of 
relative  permittivity  e,  and  electric  conductivity  a  on  temperature  and  frequency  have  been  pointed  out 
[9],  however,  in  past  experiments  these  dependences  have  been  neglected  in  both  the  calculations  [5] 
and  the  experiments  [6].  Therefore,  it  has  not  been  possible  to  understand  this  phenomena. 

Here,  we  have  measured  the  temperature  and  frequency  dependences  of  and  a  by  using  the 
infinitesimal  coaxial  probe  method  [7],  Figure  2  shows  and  a  vs.  temperature  at  50  and  100  MHz. 

As  can  be  seen  from  the  figure,  becomes  smaller  and  a  becomes  larger  as  the  temperature  rises. 
These  dependences  affect  the  electromagnetic  and  temperature  distributions  greatly,  which  is  shown  in 
section  IV(c)-(d).  From  the  same  figures,  frequency  dependences  of  £,  and  a  are  found  to  be  small 
within  50-100  MHz.  Because  the  lowest  resonant  frequency  does  not  change  during  the  heating 


343 


process,  it  is  not  necessary  to  consider  frequency  dependences  of  and  o.  Temperature  dependences 
of  thermal  conductivity  X,  specific  heat  capacity  c  and  the  volume  density  of  mass  p  were  measured, 
but  we  have  found  by  calculations  that  these  dependences  can  be  neglected.  Hereafter,  only  temperature 
dependences  of  and  a  are  taken  into  account. 

IV  Validation  of  Calculations 


(a)  Calculating  methods 


We  have  used  the  three-dimensional  Finite  Difference  Time  Domain  (3-D  FD-TD)  method  [3] 
to  solve  Maxwell's  equations: 

,dlL.  _ (i) 


VX£’=- 


5 


First,  we  derive  the  resonant  frequencies  of  the  cavity,  the  lowest  resonant  frequency  is  then  applied. 
After  analyzing  the  electromagnetic  field  distributions,  the  electromagnetic  energy  applied  to  the 
phantom  is  evaluated  as 

(2) 


q=1g 


/|E|" 


Heat  transfer  equations 


,ax^  dz^ ) 
q=ajT-Tc) 


(3) 

(4) 


are  solved  by  the  three-dimensional  Finite  Element  Method  (3-D  FEM),  where  p,  c,  X,  q,  and  T^ 
are,  respectively  volume  density  of  mass,  specific  heat  capacity,  thermal  conductivity,  heat  flux,  heat 
transfer  coefficient  and  the  external  temperature. 

Note  here  that  the  reason  we  have  used  FD-TD  method  to  solve  Maxwell’s  equations  is  the 
limitation  of  computer  memory  at  our  facility  (32  MB  RAM). 


(b)  The  model  used  for  calculations  and  measurements 


The  dimensions  of  the  cavity  used  for  calculations  and  measurements  are  shown  in  Fig.  3.  The 
diameter  and  the  height  of  the  body  are  1900  mm  and  1450  mm,  respectively.  Inside  the  body  there  are 
two  reentrants,  both  of  them  are  600  mm  in  diameter  and  500  mm  in  length.  The  phantom  we  have 
used  in  this  section  is  240  mm  in  diameter  and  250  mm  in  thickness.  The  calculation  model  is  divided 
into  FD  cells  by  using  an  automatic  mesh  generator  [10].  Variable  meshes  are  used  to  save  computing 
time  and  memory  while  maintaining  the  accuracy.  We  have  calculated  by  changing  the  cell  sizes  and 

found  that  the  cell  size  should  be  smaller  than  5  %  of  the  wave  length  in  the  phantom. 


(c)  Temperature  rises  for  phantoms  with  different  permittivities  and  conductivities 

To  understand  the  differences  of  electromagnetic  and  temperature  distributions  for  phantoms 
with  various  permittivities  and  conductivities,  both  calculations  and  measurements  are  carried  out  and 

compared.  Here,  the  external  temperature  was  set  to  20  °C.  Figures  4(a)  and  (b)  show  the  electromagnetic 
energy  and  temperature  distributions  on  the  x-y  plane,  respectively.  In  these  calculations,  relative 
permittivity  and  conductivity  are  set  to  constant  =  80  and  a  =  0.50,  which  correspond  to  the  initial 
temperature  of  10  °C.  As  can  be  seen  from  the  figures  electromagnetic  energy  is  concentrated  and  a 
temperature  rise  in  a  deep  region  is  obtained.  On  the  other  hand,  for  the  phantom  with  =  72  and  a  = 

0.74  which  correspond  to  the  initial  temperature  of  30  °C,  the  highest  temperature  region  is  found  in 
the  periphery  as  shown  in  Fig.  5(b).  From  these  results,  it  is  indispensable  to  consider  the  nonlinearity 
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Fig.2  Measured  relative  permittivity  and  electric  conductivity. 


dimensions  in  mm 

Fig. 3  Model  used  for  calculations  and  mesurements. 
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(a)  electromagnetic  energy  distribution  (b)  temperature  distribution 


Fig.4  Electromagnetic  energy  and  temperature  distributions  for  =80  and  a  =  0.50, 
which  correspond  to  the  initial  temperature  of  the  phantom  10  °C  . 


(a)  elccu-omagnetic  energy  disnibution  (b)  temperature  distribution 


Fig.5  Electromagnetic  energy  and  temperature  distributions  for  =72  and  o  =  0.74, 
which  correspond  to  the  initial  temperature  of  the  phantom  30  ®C  . 


Fig. 6  An  algorithm  to  consider  the  temperature  dependences  of  and  a. 

of  and  a.  Therefore,  we  propose  a  temperature  calculating  algorithm  considering  the  temperature 
dependences  of  permittivity  and  conductivity  as  shown  in  Fig.6. 

(d)  Comparisons  of  calculated  and  measured  results 

Figures  7(a)  and  (b)  compare  the  calculated  and  measured  results  on  the  x-y  plane,  where  the 
initial  temperature  of  the  phantom  and  the  external  temperature  were  9.2  and  21.7  °C,  respectively.  In 
calculating  the  temperature  distributions,  the  algorithm  shown  in  Fig.  6  is  used.  By  experimenting  with 
an  array  of  different  time  intervals,  it  was  found  that  updating  the  electric  constants  every  five  minutes 
was  sufficient  for  this  case.  In  Figs.  8(a)  and  (b)  the  calculated  and  measured  results  are  compared  at 

the  initial  temperature  of  23.5  °C.  The  external  temperature  was  21.0  °C.  As  can  be  seen  from  the 
figures,  close  agreements  can  be  found.  These  show  the  validity  of  the  proposed  algorithm.  However, 
a  temperature  rise  in  a  deep  region  is  not  obtained  when  the  initial  temperature  was  23.5  °C.  In  the 
next  section  an  attempt  is  made  to  obtain  selective  heat  in  a  deep  region. 

V  Computations  of  Electromagnetic  and  temperature  Distributions  for  a  long  phantom 

In  this  section,  the  computations  of  electromagnetic  and  temperature  distributions  for  a  long 
phantom  are  carried  out.  Considering  muscle,  relative  permittivity  and  conductivity  were  set  to  75 

and  0.75,  respectively  [11].  The  initial  temperature  of  the  phantom  was  set  to  20.0  °C.  The  external 

temperature  was  also  set  to  20.0  °C.  Figure  9  shows  the  model  calculated,  in  this  case  the  phantom  is 
200  mm  in  diameter  and  1500  mm  in  length.  The  conductors  were  set  to  direct  the  electromagnetic 
energy  toward  a  specific  region.  To  better  understand  the  effect  of  the  conductors  [8],  calculations 
were  carried  out  twice,  once  with  and  once  without  the  conductors. 

Figure  10  shows  the  electromagnetic  energy  distribution  on  the  x-y  plane  without  the  conductors. 
The  energy  is  directed  toward  periphery  and  we  can  understand  that  the  directed  heat  is  impossible. 
The  electromagnetic  energy  distribution  on  the  x-y  plane  with  the  conductors  are  shown  in  Fig.  11(a). 
As  can  be  seen  from  the  figure,  it  is  possible  to  direct  the  electromagnetic  energy  toward  a  deep 
region.  Selective  heat  in  a  deep  region  is  found  in  Fig.  1 1(b).  When  comparing  Figs.  10  and  1 1(a),  it 
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can  be  seen  that  when  the  conductors  are  utilized  an  obvious  improvement  in  directed  energy  toward  a 
specific  region  is  obtained  and  therefore,  the  problem  noted  in  section  IV(d)  may  be  solved.  Note  here 
that  the  roles  of  the  conductors  are  1)  shielding  the  field  and  2)  directing  the  field  by  changing  the 
electromagnetic  energy  distribution.  The  energy  distribution  is  found  to  be  critical  to  the  diameters  and 
lengths  of  the  conductors. 

This  technique  using  cylindrical  conductors  can  be  applied  to  the  phantoms  with  various 
shapes  and  electric  constants.  This  leads  to  the  possibility  of  the  reentrant  cavity  applicator  with 
cylindrical  conductors  being  used  for  a  deep-seated  cancer  tumors. 


(a)  calculated  temperature  distribution 


(b)  measured  temperature  distribution 


Fig,7  Comparison  of  calculated  and  measured  temperature  distributions, 
where  the  initial  temperature  of  the  phantom  was  9.2  °C. 


(a)  calculated  temperature  distribution  (b)  measured  temperature  distribution 


Fig.8  Comparison  of  calculated  and  measured  temperature  distributions, 
where  the  initial  temperature  of  the  phantom  was  23.5  ®C, 
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VI  Conclusion 


The  electric  constants  have  been  measured  by  using  the  infinitesimal  coaxial  probe  method.  It 
is  confirmed  that  it  is  indispensable  to  consider  the  temperature  dependences  of  relative  permittivity  £, 

and  electric  conductivity  a  to  understand  the  noted  phenomena.  We  have  proposed  an  algorithm  with 
nonlinearities  of  electric  constants  that  are  taken  into  account.  The  calculated  and  measured  results  are 
compared  for  a  disk- shaped  phantom  and  it  is  found  that  our  algorithm  is  valid.  By  using  the 
cylindrical  conductors  that  have  been  attached  to  the  reentrant  resonant  cavity  applicator,  electromagnetic 
energy  is  directed  toward  a  specific  part  of  the  long  phantom  and  good  temperature  rise  has  been 
obtained.  These  lead  to  the  possibility  of  the  reentrant  cavity  applicator  with  cylindrical  conductors 
being  used  for  a  deep-seated  cancer  tumors. 
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Abstract  Aim  of  this  paper  is  to  apply  an  error  based  finite  element  formulation  to  the  analysis 

of  the  resonant  cavities.  By  suitably  choosing  two  vector  potentials  for  the  electric  and  magnetic 
fields,  the  Maxwell  equations  as  well  as  initial  and  boundary  conditions  are  automatically  satined  for 
arbitrary  values  of  these  potentials,  being  the  error  concentrated  in  the  constitutive  equations.  The 
solution  is  then  obtained  via  minimization  of  a  global  error  functional  related  to  the  constitutive 
equations.  The  method  provides  two  complementary  solutions  and  an  error  estimation  that  can  be 
ejfectively  used  for  the  refinement  of  the  discretization.  The  solution  is  represented  by  means  of  edge 
element  based  shape  functions  and  the  evolution  of  the  system  is  analysed  in  the  time  domain.  To 
calculate  the  resonant  frequencies,  the  dynamics  of  suitable  field  components  at  proper  points  of  the 

cavity  are  Fourier  analysed.  In  addition,  by  interpreting  the  solution  at  time  t=At  as  the  frequency 

response  of  the  system  when  forced  by  a  sinusoidal  input  with  (0=1! At,  the  eigenfrequencies  are 
located  at  the  peaks  of  the  electromagnetic  energy.  Since  the  numerical  procedure  rigorously  satisfies 
Maxwell  equations,  the  well  known  problem  of  the  appearance  of  the  spurious  modes  does  not  arise. 
The  method  is  applied  to  the  analysis  of  inhomogeneously  loaded  3D  cavities. 

1.  Introduction 

This  paper  deals  with  the  analysis  of  3D  resonant  cavities.  We  apply  a  numerical  formulation  for 
the  solution  of  Maxwell  equations  which  is  based  on  the  minimization  of  the  so  called  constitutive 
error  in  the  time  domain.  Usual  methods  exactly  enforce  the  constitutive  relationships  and  only  one  of 
the  field  equations,  satisfying  the  other  one  in  a  weak  form.  On  the  other  hand,  the  error-based 
method  makes  use  of  the  vector  potentials  A  and  F  able  to  automatically  verify  both  field  equations. 
The  numerical  solution  is  then  obtained  by  minimising  a  functional  related  to  the  error  on  the 
constitutive  relationships  and  adopting  edge  element  based  functions  for  the  vector  potentials.  This 
formulation  provides  two  complementary  solutions  which  can  be  used  to  obtain  an  indication  of  the 
distribution  of  the  error  in  the  solution  domain. 

These  ideas,  originally  proposed  by  Rikabi  et  al.  in  [1],  have  been  discussed  and  applied  in  the 
time  domain  by  the  authors  [2-4]  and  in  frequency  domain  by  Bossavit  et  al.  [5-6]. 

Some  features  of  the  method  will  be  illustrated  by  determining  the  resonant  frequencies  of 
inhomogeneously  loaded  3D  cavities.  In  particular,  to  calculate  the  eigenfrequencies,  the  dynamics  of 
suitable  field  components  at  proper  points  of  the  cavity  are  Fourier  analysed.  In  addition,  by 
interpreting  the  solution  at  time  t=At  as  the  frequency  response  of  the  system  when  forced  by  a 
sinusoidal  input  with  co=l/At,  the  resonant  frequencies  are  located  at  the  peaks  of  the  electromagnetic 
energy. 

2.  The  error  based  approach. 

A  detailed  description  of  the  error  based  approach  to  the  solution  of  full  Maxwell  equations  is 
given  in  [2].  Here,  for  the  sake  of  simplicity,  we  will  focus  our  attention  to  the  model  of  Maxwell 
equations  in  non  conducting  media: 
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VxE  =  -aB/3t 
VxH  =  3D/at 


(1) 

(2) 


with  the  proper  boundary  and  initial  conditions  and  the  monotonic  constitutive  properties 

B  =  f3(H)  D  =  f^(E)  (3) 

Introducing  the  three  components  vector  potentials  A  e  F  as  the  time  integrals  of  -E  and  H,  the 
following  fields: 

E=-dA/du  H=aF/at,  B=B +VxA,  D=D +VxF  (4) 

automatically  satisfy  Eqs.  1  and  2  and  the  initial  conditions  B(x,0)=B^(x)  and  D(x,0)=D^(x). 

From  the  numerical  point  of  view  [2]  the  vector  potentials  A(x,t)  and  F(x,t)  are  approximated  in 
space  using  edge-element  based  functions  in  such  a  way  that  the  numerical  unknowns  become  the 

degrees  of  freedom  A(t)  and  F(t).  A  local  error  functional  X(x,t)>0,  with  strict  inequality  for  vector 
potentials  which  do  not  satisfy  Eqs  3,  is  then  defined  as: 

H  B  D  E 

?l(x,t)  =  0Cjj(  J  fg(h)‘dh  +  J  fjj(b)-db  +  H*'B*  -  H-B)  -h  ag(  J  fE(d)*dd  +  J  fp(e)  de 

H*  B*  D*  E* 

+  E*-D*  -  E-D)  (5) 

where  the  local  values  of  the  fields  are  given  by  Eq.  4;  and  are  the  inverse  mappings  of  fg  and 
(H*,  B*)  and  (E*,  D*)  are  two  pairs  which  satisfy  Eqs.  3;  a„and  are  weighting  factors.  In 
the  linear  case  (B=p.H,  D=8E,  H*=0,  B*=0,  E*=0  and  D*=0)  the  local  error  density  defined  by 
Eq.  5  becomes 

X  =  aH{^H2/2  +By2\i  -  B  H  }  +  ag{£E2/2  +D2/2£  -  D  E  }  (6) 

Linear  approximation  is  assumed  in  each  of  the  time  steps  in  which  the  global  interval  (0,T)  is 
partitioned,  bi  this  way  the  problem  reduces  to  the  solution  of  a  number  of  subproblems  defined  in 
the  time  steps  in  which  the  only  unknowns  are  the  vector  potentials  at  final  time.  Each  subproblem 

can  be  solved  by  minimizing  a  global  error  functional  related  to  X(x,  t)  as: 

A  =  J*'*'  J  X(x,  t)  dvdt  (7) 

where  V  is  the  domain  of  integration,  (tj.  ,ti.+i)  is  the  k-th  time  step. 

With  reference  for  instance  to  the  linear  case,  the  following  set  of  linear  algebraic  equations  is 
obtained  under  minimization: 


k+1 

The  time  integral  X(x,t)dt  is  approximated,  according  to  the  0  method,  as  X,(x,  te)At  with 

At=tk+i-tk  and  te=  6tk+i+(l-0)tk.  Moreover,  the  choice  a„=aE  assures  the  splitting  of  system  (9), 
i.e.  the  two  unknown  potentials  Ak+i= A(tk+i)  and  Fk+i=F(tk+i)  can  be  obtained  as  the  solution  of 


Ak+l 

aA/aA 

.^k+1. 

aA/apJo 
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two  independent  systems  of  equations,  since  in  this  case 
[a^A/aAk+iaFk+i]  =  [3^A/3Fk+iaAk+i]^  =  [0].  TOs  splitting  is  however  different  from  solving 

two  separate  problems  for  A  and  F  in  (0,  T)  because  in  our  case  each  potential  at  tk+i  depends  upon 
both  potentials  at  t^.  We  notice  that  the  choice  0=0  leads  to  a  classic  explicit  scheme  which  is  affected 
by  numerical  instabilities.  In  this  case,  A  and  F  are  not  updated  simultaneously,  but  Ak+i  is 

conventionally  replaced  by  Ak+1/2  ^rid  Ak  by  Ak-i/2«  Moreover,  since  Ak+1/2  and  Fk+i  are  not 
explicitly  available  the  dynamic  matrix  can  be  diagonalized  by  using  the  trapezoidal  rule  for  the 
numerical  evaluation  of  the  relevant  space  integrals,  a  treatment  similar  to  the  mass  lumping  [7]. 

The  main  difference  with  the  approach  of  ref.  [5]  is  that  the  error  functional,  being  expressed  in 
the  time  domain,  is  a  positive  definite  quantity,  leading  to  the  splitting  as  a  result  of  a  minimization 
procedure. 

3.  Numerical  Results 

The  method  is  applied  to  the  analysis  of  3D  resonant  cavities.  The  problem  has  been  solved  in  the 

time  domain  starting  from  the  initial  conditions  D(x,  0)=Do(x)=0  and  B(x,  0)=Bo(x)  =  VxAq  at 
time  t=0.  The  vector  potential  Aq  has  been  selected  equal  to  a  linear  combination  (with  random 
coefficients)  of  all  edge  element  based  shape  functions,  having  excluded  those  corresponding  to 
edges  on  the  electric  walls.  The  resonant  jfrequencies  are  then  obtained  performing  a  Fourier  analysis 
of  the  numerical  results.  This  method  is  particularly  useful  when  dealing  with  walls  of  finite 
conductivity. 

On  the  other  hand,  in  case  of  a  lossless  medium,  if  the  potentials  A  and  F  (as  well  as  the  fields  E 
and  H)  are  represented  in  the  frequency  domain,  one  of  them  will  be  real  and  the  other  one  pure 
imaginary.  This  allows  to  obtain  a  solution  of  the  field  problem  in  the  frequency  domain  using  the 
results  of  the  code  running  in  the  time  domain.  In  fact,  using  an  implicit  scheme.  Maxwell  equations 
at  time  At  are  approximated  as 


VxE(x,  At)  =  |i 
V  X  H(x,  At)  =  8 


H(x,  At) 
At 

E(x,  At) 


(9) 

(10) 


with  |i  =  -jio'  If  we  compare  these  equations  with  the  corresponding  ones  in  the  frequency  domain 


VxE(x,  co)=-jco|LioH(x,  CO)  (11) 

VxH(x,  o))=jcoEE(x,  co)  (12) 


and  we  pose  H(x,  co)=jh(x,  co)  and  E(x,  co)=e(x,  00),  we  notice  that  can  be  established  the  following 
correspondence: 


co=l/At,  A  =  H(x,  At)=h(x,  co),  E(x,  At)=e(x,  co)  (13) 

This  means  that  with  these  definitions  the  solution  of  the  system  (8)  at  time  t=At  corresponds  to  the 
sinusoidal  solution  in  the  frequency  domain  at  the  angular  fr^uency  co=l/At  The  frequency  response 
can  be  obtained  [8]  by  forcing  the  system  with  a  sinusoidal  input;  the  eigenfrequencies  correspond  to 
the  peaks  of  the  reactive  power  that,  according  to  Foster's  reactance  theorem,  is  an  increasing 
function  of  the  angular  frequency. 
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Notice  that  the  formulation  here  illustrated  is  free  of  spurious  modes  since  Maxwell  equations  are 
rigorously  satisfied  In  particular  the  fields  D  and  B  are  automatically  divergence-free.  We  refer  to 
[5]  for  a  deeper  discussion  of  this  point. 

Though  the  proposed  method  is  fully  3D  and  then  applicable  to  problems  of  arbitrary  geometry, 
the  first  example  is  related  to  the  analysis  of  the  simple  cubic  resonator  as  an  attempt  to  validate  the 
proposed  procedure  and  to  estimate  its  accuracy.  The  3D  resonant  frequencies  of  the  cube  are  given 

by  fimn  -  c#  -I-  m^  +  n^  /za  ,  where  a  is  the  length  of  the  edge  of  the  cube  (a=l  m  in  our  case). 
TTie  numerical  analysis  has  been  performed  in  the  time  domain  using  8192  time  steps  of  50  ps,  with  a 

Courant  number  (calculated  on  the  finer  mesh)  of  Cou=c — V3  =0.62.  Three  meshes  (4x4x4, 

Ax 

8x8x8, 12x12x12  equally  spaced  hexahedral  elements)  of  one  eight  of  the  cube  have  been  analysed. 
With  the  finer  mesh  the  fundamental  mode  is  discretiz^  with  48  elements  per  wavelength,  while  the 
last  depicted  mode  (1=1,  m=5,  n=0)  has  still  a  sufficient  number  of  elements  per  wavelength.  The 
modes  with  same  frequency  and  different  shape  (the  "degenerate  modes")  have  been  found  by  a 
proper  handling  of  the  symmetry  conditions  on  the  3  symmetry  planes  (which  have  been  considered 
as  either  electric  or  magnetic  walls)  The  results  of  the  Fourier  analysis  are  reported  in  Table  I  showing 
a  good  agreement  with  the  analytic  solution.  Notice  that  there  is  no  evidence  of  spectrum  pollution, 
i.e.  spurious  modes  do  not  appear. 


Mode 

Analytical 

Numerical 

f[MHz] 

Coarse  mesh 

Medium  mesh 

Fine  mesh 

f[MHz] 

f[MHz] 

f[MHz] 

no 

211.979 

215 

212 

212 

111 

259.621 

263 

261 

261 

012 

335.169 
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339 

337 

112 

367.159 

381 

371 

369 

220 

423.959 

446 

430 

425 

221 

449.676 

471 

454 

452 

130 

474.000 

532 

486 

478 

113 

497.135 

551 

507 

500 

222 

519.241 

546 

528 

520 

203 

540.443 

598 

552 

544 

132 

560.844 

618 

571 

564 

223 

618.021 

676 

630 

623 

401 

618.021 

774 

647 

630 

330 

635.938 

718 

652 

642 

114 

635.938 

784 

664 

647 

331 

653.364 

734 

668 

659 

420 

670.337 

878 

703 

681 

421 

686.891 

910 

715 

698 

332 

703.056 

925 

718 

708 

224 

734.417 

>1000 

762 

745 

403 

749,460 

964 

779 

762 

134 

764.302 

>1000 

793 

774 

150 

764.302 

>1000 

825 

786 

333 

778.862 

878 

798 

783 

423 

807.193 

>1000 

835 

818 

Table  I:  Homogeneous  cubic  resonant  cavity.  Fourier  analysis  for  three  different  discretizations.  The  values  refer  to 
Ey  or  Ez  at  x=f).375  m,  y=0.375  m,  z=0.375  m. 

The  second  example  refers  to  a  rectangular  cavity  loaded  with  a  dielectric  slab  as  shown  in  fig.  1. 
One  eight  of  the  domain  has  been  discretized  by  imposing  nxH=0  at  x=a/2  and  z=c/2,  nxE=0  at 
y=b/2.  Two  meshes  have  been  considered:  a  coarse  mesh  with  10x4x4  equally  spaced  hexahedral 
elements  and  a  fine  mesh  with  20x8x8  equally  spaced  hexahedral  elements.  The  analysis  has  been 
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carried  out  again  in  the  time  domain  (8192  time  steps  of  50  ps).  In  fig.  2  the  spectra  of  obtained 
by  a  Fourier  analysis  of  the  numerical  results  calculated  inside  the  dielectric  at  the  point  Pi  (x=0.250 
m,  y=0.075  m,  z=0.100)  in  case  of  the  coarse  mesh  and  at  the  point  P2  (x=0.050  m,  y=0.075  m, 
z=<).250  m)  in  case  of  the  fine  mesh  are  shown.  The  resonance  frequencies  of  the  main  modes  are 
reported  in  Table  n  in  comparison  with  the  analytic  solutions. 


Figure  1:  Geometry  of  the  rectangular  cavity  loaded  with  a  dielectric  slab.  In  the  numerical  example  a- 1  m,  b=.3m,  c 

=.4m,  d=.4m,  t=.2m,  Ef  =16. 


Figure  2:  Rectangular  cavity  loaded  with  a  dielectric  slab.  Spectra  of  Ey=-3Ay/8t  at  points  PI  (coarse  mesh)  and  P2 

(fine  mesh). 

The  electromagnetic  field  distribution  for  a  given  mode  can  also  be  reconstructed.  The 
eigenfunction  of  an  arbitrary  quantity  q  corresponding  to  the  resonant  frequency  fi  can  be  computed 
by 

qi(x)  =  A(x,  fi)  cosd)(x,  fi)  (14) 

where  A(x,  f)  and  <I>(x,  f)  are  amplitude  and  phase  given  by  the  Fourier  analysis  of  q(x,t)  at 
frequency  f.  For  example,  fig.  3  shows  the  Ey  distributions  in  modes  TEioi  and  TE301  as  a  function 
of  X  for  different  discretizations,  as  obtained  from  the  A  solution  (E(A)=-“3A/3t)  and  from  the  F 
solution  (E(F)=VxF/e).  From  these  results  it  clearly  appears  the  role  of  the  constitutive  error  in  the 
estimation  of  the  correctness  of  the  numerical  solution. 
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In  fig.  4,  the  electric  field  estimates  Ey(A)  and  Ey(F)  are  shown  as  functions  of  the  time  at  point  P2 
of  the  fine  mesh  inside  the  dielectric.  It  can  be  seen  that  the  constitutive  error  is  much  higher  than  in 
the  mode  reconstruction  (see  fig,  3),  The  main  reason  is  that  the  random  initial  conditions  excite  the 
modes  whose  wavelenght  is  comparable  with  the  mesh  size. 


Figure  3:  Rectangular  cavity  loaded  with  a  dielectric  slab.  Electric  field  estimates  Ey(A)=-^Ay/9t  and 

Ey(F)=(dFx/3z-dFz/3x)/e  and  error  field  Ey(A)-Ey(F)  as  functions  of  x  at  y=0.075  m,  z=0.100  m:  a)  TEioi 
eigenfunction  (coarse  mesh);  b)  TEtoi  eigenfunction  (fine  mesh)  c)  TE301  eigenfunction  (fine  mesh). 

Furthermore,  we  report  some  results  of  the  analysis  based  on  the  correspondence  described  by 
Eqs  13.  The  cavity  is  excited  with  an  arbitrary  sinusoidal  input  of  fixed  amplitude.In  fig.  5,  the 
reactive  power  is  shown  as  a  function  of  the  frequency  of  the  sinusoidal  input.  In  Table  HI,  the  first 
resonant  frequencies  for  the  coarse  and  the  fine  mesh,  as  given  by  the  location  of  the  energy  peaks 
computed  using  the  A  and  F  variables,  respectively.  It  can  be  seen  that,  in  this  case  no  upper  and 
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lower  bounds  are  obtained.  We  notice  that  our  analytical  and  numerical  values  of  the  resonant 
frequency  of  the  ftindamental  mode  is  slightly  different  from  the  one  reported  in  [9, 10]. 

The  same  analysis  has  been  performed  on  the  inhomogeneously  loaded  cavity  (0<x<0.200m, 

0<y<0.500m,  0<z<0.300m)  described  and  studied  f.i.  in  [5,1 1,12],  The  relative  permittivity  is  er=16 
in  the  dielectric  region  (0<x<0.075m,  0<y<0.125m,  0<z<0.175m)  and  er=l  in  the  rest  of  the 
domain.  Magnetic  walls  are  placed  at  x=0  and  y=0  whereas  the  other  walls  are  perfectly  conducting. 
Some  preliminary  results  obtained  in  the  time  domain  were  already  presented  in  [4].  Here,  we  present 
in  Table  IV  the  value  of  the  resonant  frequency  of  the  fundamental  mode  as  obtained  by  the  first  peak 
of  the  reactive  power  computed  via  Eq.  13  with  the  fine  mesh  used  in  [4]. 


4.50E-7  5.00E-7  5.50E-7  S.OOE-7 
Time  [sec] 

Figure  4:  Rectangular  cavity  loaded  with  a  dielectric  slab;  Ey(A.)  (solid  line)  and  Ey(F)  (dashed  line)  as  functions  of 
the  time  at  point  P2  of  the  fine  mesh  inside  the  dielectric. 


Frequency  [Hz] 

Figure  5:  Rectangular  cavity  loaded  with  a  dielectric  slab:  reactive  power  (in  arbitrary  units)  as  a  function  of  the 

frequency  of  the  sinusoidal  excitation. 


Mode 

Analytical 

Numerical 

f[MH2] 

Coarse  mesh 

Fine  mesh 

f(MHz] 

f[MHz] 

101 

130,972 

132 

132 

121 

297.243 

313 

300 

103 

310.873 

>450 

320 

301 

392.282 

>450 

410 

Table  II:  Rectangular  cavity  loaded  with  a  dielectric  slab.  Fourier  analysis  of  the  estimate  of  Ey  given  by  the  A 
potential  (Ey--8Ay/9t)  at  PI  (coarse  mesh)  and  P2  (fine  mesh). 
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Mode 

Analytical 

Numerical 

f[MHz] 

Coarse  mesh: 

f  [MHz] 

Fine  mesh:  f  [MHz] 

A  variables 

F  variables 

A  variables 

F  variables 

101 

130.972 

132 

131 

131 

131 

121 

297.243 

305 

303 

299 

299 

103 

310.873 

328 

326 

315 

315 

301 

392.282 

420 

418 

401 

400 

Table  III:  Rectangular  cavity  loaded  with  a  dielectric  slab.  The  first  resonant  frequencies  for  the  coarse  and  the  fine 
mesh,  as  given  by  the  location  of  the  peaks  of  the  reactive  power,  computed  using  the  A  and  F  variables 

Formulation  flMHz] 

A-F  Time  domain  259 

A  (Reactive  power  peaks)  256 

F  (Reactive  power  peaks)  258 

E  (Ref.  [6])  254.9 

H(Ref.[6])  254.5 

H(Ref.[9])  267 

A-V(Ref.  [10])  258 

F-V(Ref.  [10])  259 

Table  IV:  Inhomogeneously  loaded  3D  cavity.  Values  of  the  fundamental  resonant  firequency  given  by  the  Fourier 
analysis  and  by  the  first  peak  of  the  reactive  power,  compared  with  the  results  reported  in  of  [6,9,10]. 

4.  Conclusions 

In  this  paper,  a  numerical  formulation  of  3D  Maxwell  equations  in  the  time  domain  based  on 
the  minimization  of  the  constitutive  error  has  been  applied  to  the  analysis  of  resonant  cavities.  The 
method  present  several  advantages  both  in  comparison  with  finite  difference  schemes  on  structured 
meshes,  both  in  comparison  with  finite  element  formulations  applied  to  the  solution  of  the  vector 
wave  equation  on  unstructured  meshes.  In  fact,  it  combines  the  positive  features  of  the  first,  in 
particular  the  possibility  of  having  stable  explicit  time  domain  schemes,  with  the  typical  good  features 
of  the  second  with  respect  to  the  effective  use  of  unstructured  meshes.  Moreover,  the  exact 
verification  of  Maxwell  equations  is  guarantied,  being  the  error  concentrated  in  the  constitutive 
equations,  henceforth  no  spurious  modes  arise;  the  solution  is  obtained  via  minimization  of  a  global 
error  functional  and  an  error  estimation  is  readily  available.  For  the  study  of  3D  resonant  cavities  the 
results  produced  using  Foster's  reactance  theorem  are  more  accurate  than  those  obtained  via  Fourier 
analysis. 
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Abstract 

Hiis  paper  describes  a  system  capable  of  taking  an  initial  set  of  design  specifications  and 
creating  and  optimizing  a  complete  device.  Two  basic  processes  are  used  coupled  with  numerically 
based  analysis  tools.  These  processes  are  those  of  case  based  reasoning  and  sensitivity  analysis.  The 
former  allows  the  determination  of  an  initial  design  choice  relatively  rapidly  while  the  second  provides 
a  basis  for  the  use  of  first  order  optimization  algorithms.  The  total  system  also  provides  a  learning 
environment  in  which  the  system  gradually  becomes  better  at  designing  devices  of  a  particular  class. 


Introduction 


The  complete  design  of  an  electromagnetic  device,  as  with  any  other  structure,  requires  an 
iterative  loop  of  synthesis  and  analysis.  This  is  often  described  as  a  hierarchical  process,  an  iterative 
process,  a  waterfall  model,  or  in  many  other  ways;  but  the  essence  is  that  design  starts  with  a 
requirement  for  a  device  to  fulfil  a  purpose  subject  to  a  set  of  constraints  and  this  forms  the  initial 
specification.  The  final  device  is  fully  described  in  terms  of  a  multitude  of  parameters  -  often 
numbering  several  thousand  -  and  the  design  ’process'  is  essentially  one  of  exploring  the  design  space 
described  by  these  parameters  subject  to  a  set  of  implicit  and  explicit  constraints.  In  order  to  make  this 
problem  tractable,  engineers  use  a  process  of  information  hiding  and  sub-problem  analysis  to  reduce 
the  dimensionality  of  the  search  space  such  that  an  exploration  of  the  total  design  space  can  be  guided 
in  some  way.  One  traditional  method  for  doing  this  is  to  examine  existing  devices,  find  one  that  is  near 
to  the  desired  specifications  and  modify  it.  The  modified  device  is  then  tested  and  remodified.  Thus 
design  becomes  an  evolutionary  process.  However,  building  and  testing  devices  is  an  expensive 
business  and  analytical  and  numerical  techniques  of  ever  more  increasing  complexity  provide  a  method 
for  predicting  how  the  changes  might  affect  the  performance  without  having  to  build  the  device.  Two- 
and  three-dimensional  numerical  analysis  techniques  represent  the  current  state  of  the  art  in  this  area 
and  can  almost  provide  a  simulated  laboratory  for  testing  designs  without  actually  building  them. 

Recently,  considerable  interest  has  been  focussed  on  optimization  processes  which  automatically 
modify  designs  to  meet  specified  performance  requirements  or  goals  [1],  [2],  [3],  [4].  These 
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optimization  techniques  break  into  two  classes  -  deterministic  methods  and  stochastic  approaches.  In 
the  former  category  are  techniques  which  require  knowledge  of  the  derivatives  of  the  cost  functions 
with  respect  to  the  design  parameters,  in  the  second  are  approaches  such  as  simulated  annealing  and 
genetic  algorithms  in  which  the  derivatives  of  the  cost  functions  are  determined  by  exploring  the  search 
space  by  solving  sets  of  examples. 

The  computer  aided  analysis  tools  which  currently  exist  for  general  purpose  two  and  three 
dimensional  analysis  are  almost  all  based  on  numerical  processes  and  allow  the  solution  of  one 
particular  design.  Much  as  in  the  real  laboratory,  derivatives  of  performance  variables  with  respect  to 
the  design  parameters  are  not  often  available  and  this  has  led  to  the  recent  growth  in  interest  in 
stochastic  systems.  However,  if  sensitivity  information  can  be  generated  in  a  form  which  will  allow 
first  order  deterministic  optimization  methods  to  be  applied,  it  is  possible  to  set  up  the  synthesis,  or 
inverse,  problem  and  to  solve  it  directly  [5].  With  numerically  based  systems,  this  process  can  be 
extremely  slow,  requiring  many  simulated  tests  of  the  device. 

An  alternate  approach  is  to  use  existing  experience  to  propose  an  initial  configuration  for  the 
device.  This  allows  the  final  optimization  process  to  concentrate  on  a  small  part  of  the  search  space 
with  the  corresponding  gain  in  speed.  Such  a  combined  approach  is  described  below.  The  knowledge 
based  approach  being  considered  is  that  of  Case  Based  Reasoning  [6]  which  matches  the  given 
specifications  onto  a  database  of  existing  designs.  Such  an  approach  provides  two  pieces  of 
information:  the  first  is  a  topology  for  the  device  which  will  meet  the  requirements  and  the  second  is 
an  initial  set  of  geometric  values  providing  a  starting  point  for  the  design  search.  A  new  design  is  then 
generated  based  on  a  combination  of  the  nearest  matches  found.  This  device  can  be  modified  in  shape 
by  using  sensitivity  information  to  allow  first  order  optimization  processes  to  be  applied. 


Identifying  an  Initial  Population  of  Devices 

All  electromagnetic  devices  operate  subject  to  Maxwell’s  Equations,  at  least  at  the  macroscopic 
level,  and  thus  at  the  highest  level  they  are  all  similar.  In  an  object  oriented  sense,  a  base  class  of 
electromagnetic  devices  can  be  described  which  have  certain  properties,  i.e.  they  are  all  built  out  of 
physical  materials,  they  are  all  subject  to  the  basic  electromagnetic  field  laws,  etc.  A  hierarchy  of 
classes  can  now  be  constructed  from  this  base  class,  some  of  these  classes  involve  the  coupling  of 
electric  and  magnetic  circuits  to  transform  energy,  leading  to  the  classes  of  transformers  and  electro¬ 
mechanical  devices.  The  latter  sub-class  includes  certain  extra  topological  requirements  such  that 
motion  can  take  place.  Indeed,  this  concept  of  inheritance  and  derived  classes  has  been  implicit  in  the 
traditional  methods  of  teaching  electromechanical  energy  conversion. 

Any  device  that  is  constructed  inherits  all  the  properties  from  all  its  parent  classes  (devices)  in 
this  class  hierarchy.  Each  physical  instantiation  of  a  class  can  be  regarded  as  a  case  and,  in  true  object 
oriented  fashion,  it  carries  with  it  all  the  ’knowledge’  of  how  to  function  in  its  environment.  Thus  a  set 
of  switched  reluctance  motors  is  made  up  of  several  instantiations  of  a  derived  class  of  switched 
reluctance  devices.  True  to  the  object  oriented  paradigm,  there  is  considerable  information  hiding 
happening  in  the  final  devices  -  in  other  words,  many  of  the  rules  and  laws  describing  the  device 
operation  are  implicit  and  can  only  be  determined  from  a  set  of  tests  on  the  device.  In  fact,  the 
separation  of  the  rules  of  the  device  operation  into  a  single  unique  set  cannot  necessarily  be  done  on 
a  single  device.  Instead,  a  whole  range  of  devices,  i.e.  cases,  will  be  needed. 

Thus,  the  knowledge  involved  in  designing  a  particular  kind  of  device  can  be  described  by 
showing  several  examples  of  the  device.  This  is  the  basis  of  Case  Based  Reasoning.  The  approach 
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avoids  the  need  to  store  all  the  design  rules  for  a  device;  instead,  it  uses  examples  or  instantiations  as 
a  method  of  implicitly  storing  knowledge. 

The  advantage  of  this  approach  is,  as  with  all  object  oriented  systems,  a  reduction  in  the 
dimensionality  of  the  space  needed  to  store  the  information.  The  reduction  in  dimensionality  is  traded 
off  for  storing  many  examples.  Thus,  as  the  number  of  examples  grows,  the  implicit  dimensionality 
grows  in  that  it  should  be  possible  to  determine  all  the  parameters  and  rules  governing  the  device 
explicitly.  However,  this  would  require  the  choice  of  exactly  the  correct  set  of  examples  and,  since  the 
correct  answer  is  not  known,  the  choice  of  the  correct  set  is  extremely  difficult  to  make.  In  an 
industrial  environment,  the  examples  are  limited  to  those  which  have  already  been  produced.  An 
approach  of  reducing  dimensionality  in  this  fashion  obviously  simplifies  the  data  storage  structures  but 
also  will  lead  to  storage  of  redundant  information.  This  latter  effect  is  usually  accepted  as  a  reasonable 
price  to  pay. 

Even  with  this  reduction  in  dimensionality,  the  number  of  parameters  to  be  stored  to  fully 
describe  a  complete  device  can  be  extremely  high,  often  running  into  the  thousands.  An  initial  design 
specification  cannot  be  that  detailed  -  if  it  was,  the  design  would  be  complete  and  there  would  be  no 
work  to  do!  The  problem,  then,  is  still  one  of  accessing  the  information  built  into  the  case  database 
starting  with  only  the  initial  design  specifications.  In  addition,  the  initial  specifications  may  consist  of 
any  one  of  many  subsets  of  the  total  parameter  set  or  may  be  expressed  in  terms  of  combinations  of 
the  basic  parameters,  i.e.  higher  level  functions.  For  a  case  based  reasoning  system  to  function 
effectively,  it  must  be  able  to  reason  from  the  specifications  provided  and  choose  a  set  of  cases  from 
its  database  which  most  closely,  in  some  sense,  match  the  specifications. 

These  cases,  in  effect,  provide  the  basis  for  completing  the  design  -  they  ail  contain  all  the 
parameters  needed  to  create  a  final  device.  The  choice  of  a  case  immediately  provides  values  for  all 
the  unknown  parameters,  in  effect  a  table  lookup  system  is  implemented. 

Once  an  initial  population  of  devices  has  been  found,  the  next  step  is  to  modify  these  devices 
to  better  fit  the  required  specifications.  This  may  be  done  in  several  ways,  including  a  genetic  or 
evolutionary  approach.  However,  the  method  used  here  tries  to  avoid  such  stochastic  systems.  Rather, 
an  initial  structure  for  the  desired  device  might  be  obtained  by  some  form  of  interpolation  within  the 
initial  population  or  just  the  nearest  match  might  be  chosen.  The  next  sections  describe  one  approach 
to  finding  this  initial  set. 


Feature  Extraction 


When  a  device  is  stored,  it  exists  in  a  multi-dimensional  space  but  not  all  these  dimensions,  or 
parameters,  are  useful  for  retrieving  it.  In  fact,  to  retrieve  it  based  on  a  small  set  of  specifications 
requires  that  some  amount  of  compaction  in  the  description  of  the  device  is  necessary.  The  problem 
is  to  match  the  provided  specifications  on  to  the  set  of  stored  parameters.  This  can  be  achieved  in  two 
steps.  The  first  step  is  that  of  feature  extraction  -  a  process  routinely  used  in  pattern  recognition  [7]. 
A  feature  is  a  parameter  of  a  device  which  best  describes  the  class  of  the  device,  i.e.  those  parameters 
which  most  affect  the  performance  of  the  device.  An  exhaustive  search  of  all  the  parameters  over  a 
set  of  devices  within  a  specific  class  is  too  expensive  so  the  methods  employed  are  usually  based  on 
a  heuristic  approach. 

A  major  requirement  in  extracting  the  key  parameters  is  that  they  should  be  linearly  independent 
and  orthogonal.  This  means  that  one  feature  at  a  time  can  be  considered  without  affecting  the  others. 
The  question  to  be  answered  by  any  criterion  which  is  estabhshed  is:  'Which  features  identify  this  class 
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as  distinct  from  any  other  class?'  This  leads  to  the  concept  of  a  separability  criterion  which  has  been 
more  fully  described  elsewhere  [8],  Such  a  criterion  allows  a  quantitative  measure  of  separability 
between  two  classes  to  be  established  for  each  feature  of  a  set  of  devices  and,  in  turn,  provides  a 
method  of  ranking  the  importance  of  each  feature  within  a  specific  class. 


The  Algebraic  Model 

The  extraction  of  important  features  of  a  class  is  not,  by  itself,  sufficient  to  provide  a  way  of 
matching  specifications  onto  the  database  of  cases  because  the  choice  of  the  feature  set  is  not  unique. 
However,  each  choice  can  be  converted  into  any  other  choice  by  suitable  transformations.  Such 
transformations  can  be  described  by  an  algebraic  model  of  the  device.  Many  such  models  have  been 
developed  by  designers  over  time  and  are  generally  used  to  determine  the  implications  of  certain  design 
choices.  A  set  of  algebraic  equations  basically  provides  a  mechanism  for  propagating  information  and 
deriving  (inferring)  more  information.  The  equations  can  be  described  in  terms  of  a  network  which 
provides  the  mechanism  for  moving  from  one  parameter  set  to  another.  This  is  the  second  step  in 
matching  stored  cases  to  the  provided  specifications. 

The  set  of  specifications  can  be  used  to  identify  the  class  of  device  which  is  to  be  used  to 
satisfy  the  design  requirements,  the  identification  of  the  class  activates  one  particular  algebraic  model 
and  this  allows  the  parameters  in  the  specifications  to  be  converted  into  the  parameters  used  to  store 
the  current  cases.  The  process  for  doing  this  has  been  implemented  as  an  Algebraic  Constraint  System 
[9]. 


Similarity 

At  this  point  in  the  process,  a  set  of  parameters  has  been  generated  which  can  be  matched  to 
the  stored  cases  and  the  final  requirement  is  to  determine  the  'closeness*  or  degree  of  similarity  between 
the  required  device  and  the  existing  designs.  This  similarity  measure  is,  again,  a  heurisiically  based 
function.  The  total  similarity  of  the  input  specification  to  the  retrieved  case  is  measured  by  the  strength 
of  the  similarity  of  each  parameter.  If  there  is  a  direct  match  between  two  parameters  then  the 
similarity  has  a  value  of  1.0.  The  function  used  is: 


Similarity= 


_ 1 _ 

l+ax|logpi-logp2| 


(1) 


where 

pi  =  stored  case  parameter  value 
p2  =  input  case  parameter  value 

a  -  an  adjustable  constants  whose  value  depends  on  the  application. 

After  each  parameter  is  processed  the  overall  similarity  value  is  calculated  by  the  Nth  root  of 
the  individual  similarity  shown  in  equation  (2). 
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1 


(2) 


TotalSimilarity^i 

1 


5-  =  the  similarity  of  the  ith  matched  parameter,  n  =  the  number  of  matched  parameters 

If  the  total  similarity  value  is  below  a  predetermined  threshold  the  match  is  abandoned.  Those 

cases  with  a  total  similarity  value  above  the  threshold  will  be  sorted  and  retrieved. 

At  this  point,  several  designs  have  been  retrieved  all  of  which  are  close  to  the  desired  device 

but  none  is  a  perfect  match.  A  final  possible  device  to  match  the  specifications  is  derived  by  means 

of  interpolation  between  the  retrieved  designs  or,  more  simply,  the  closest  match  may  be  used  as  a 
starting  point  for  an  optimization  process. 


Optimizing  the  Design 

While  the  above  process  will  have  generated  a  possible  design,  there  is  no  guarantee  that  it  will 
perform  correctly.  In  fact,  if  the  closest  existing  design  is  used  it  is  already  known  that  it  is  not  a 
perfect  match.  However,  the  required  design  will,  in  all  probability  be  somewhere  close  by  in  the 
search  space.  If  the  design  is  sufficiently  close,  the  chance  of  finding  a  globally  optimum  solution, 
rather  than  a  local  one  will  be  somewhat  increased.  To  complete  the  design,  it  is  necessary  to 
investigate  the  effects  of  parameter  variations  on  the  device  performance  in  somewhat  more  detail  and 
this  work  is  computation^ly  intensive  and  thus  time  consuming.  Usually,  the  variations  take  the  form 
of  shape  modifications  although  material  changes  might  be  considered  as  well. 

Two  questions  to  be  answered  at  this  point  in  the  design  are:  ’What  is  the  device  which  matches 
the  specifications  within  the  required  error  bounds?’  and  ’How  sensitive  is  the  performance  of  the 
device  to  small  variations  in  shape  and/or  material  properties?'  Both  these  questions  can  be  answered 
if  sensitivity  information  is  available.  The  first  question  can  be  addressed  by  using  the  sensitivity 
information  to  drive  the  optimization  process  in  those  directions  which  will  be  the  most  profitable  and 
the  second  question  can  be  answered  directly  by  the  various  sensitivities  at  the  optimum  design. 

Such  sensitivity  information  may  be  derived  directly  from  the  underlying  field  equations  and 
this  process  is  outlined  in  the  following  sections. 


Design  Sensitivity  Analysis 

This  section  describes  how  first  order  optimization  algorithms  can  be  applied  to  optimizing  the 
device  performance  using  Design  Sensitivity  Analysis.  Sensitivity  is  the  derivative  information 
required  by  first  order  optimization  methods  [10].  The  process  requires  that  the  design  specifications 
be  formulated  as  a  mathematical  objective  function  such  that  a  minimum  of  the  objective  function 
corresponds  to  a  valid  design.  The  objective  function  will  include  both  the  costs  associated  with  the 
design,  such  as  power  dissipation  or  amount  of  material,  as  well  as  constraints  in  the  design 
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requirements,  for  example  the  required  force  of  an  actuator. 

Since  the  objective  function  encodes  the  performance  of  the  magnetic  device,  it  will  be  a 
function  of  one  or  both  of  the  electric  and  magnetic  fields.  But  what  is  required  is  the  derivative  of 
the  objective  function  with  respect  to  the  design  variables.  The  fields  depend  implicitly  on  the  design 
variables  through  Maxwell's  equations.  Design  Sensitivity  Analysis  effectively  provides  the  means  to 
compute  the  derivative  of  this  implicit  dependence. 


Adjoint  System 


The  sensitivity  analysis  starts  with  an  adjoint  system  which  has  the  same  material  configuration 
as  the  original  system  (system  A),  but  the  sources  (i.e.  source  current  density)  of  the  adjoint  system 
(system  B)  are  chosen  as  follows: 


/,B= 

4b  = 


(3) 


where  is  the  magnetic  source  current  density,  and  the  gradients  are  the  partial  derivatives  of  the 
objective  function  with  respect  to  the  field  components  at  a  point  (see  [11]  for  details). 


Shape  Sensitivity 

Once  the  fields  for  the  adjoint  system  are  computed  (by  solving  Maxwell's  equations),  the 
sensitivity  of  the  objective  function  F  to  variations  in  shape  can  be  readily  computed  using  the 
following  equation: 

5F  = 

-  JO)  +  E^<jsM-FsBb»^^ 

where  6F  is  the  perturbation  in  the  objective  function  due  to  a  displacement  6s  of  the  surface  in  the 
direction  of  the  surface  normal.  In  this  equation,  a^,  and  \x^  are  the  conductivity,  permittivity  and 
permeability  of  the  region  inside  the  surface,  while  the  same  quantities  with  a  b  subscript  denote  the 
material  properties  of  the  region  outside  the  surface.  Also  source  current  density  in  the 

region  inside  the  surface,  and  is  the  source  current  density  outside  the  surface.  Note  that 
continuity  requires  that  the  normal  component  of  the  source  current  density  must  be  equal  on  both  sides 
of  the  surface.  Likewise,  since  the  fields  may  be  discontinuous  across  the  interface,  the  same 
subscripts  are  used  to  distinguish  fields  on  the  inside  of  the  surface  (a)  from  fields  on  the  outside  of 

the  surface  (b).  Thus,  for  example,  is  the  magnetic  field  of  the  adjoint  system  just  outside  the 
surface.  The  total  perturbation  in  the  objective  function  is  found  by  integrating  (4)  over  the  surface 
of  interest.  The  derivative  of  the  objective  function  with  respect  a  design  parameter  which  controls 
the  position  of  the  surface  is  computed  using  the  chain  rule  after  computing  the  derivative  of  65  at 
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every  point  on  the  surface. 

Design  parameters  may  also  control  other  device  properties  besides  shape,  for  example  material 
properties  and  current  distributions.  The  sensitivity  to  these  design  variables  may  also  be  computed 
using  the  fields  of  the  same  adjoint  system,  [11]  contains  the  details  for  this  case. 


Interface  with  Case  Based  Reasoning 

It  should  be  obvious,  from  the  above  summary  of  sensitivity  calculations,  that  setting  up  the 
optimization  process  is  not  a  simple  task.  Even  for  relatively  basic  objective  functions,  the 
implementation  details  of  adjoint  system  sources  and  surface-parameter  relationships  can  become  quite 
involved,  and  extensive  testing  is  required  to  ensure  that  what  is  calculated  is,  in  fact,  the  desired 
derivative.  However,  this  is  where  the  advantages  appear  of  merging  optimization  with  the  case  based 
reasoning  system,  because  the  implementation  details  specific  to  each  device  can  be  worked  out  before 
hand  and  stored  with  the  device  in  the  case  based  reasoning  database.  Therefore,  the  case  based 
reasoning  system  provides  basically  everything  required  for  optimization:  the  parameterized  device 
template,  the  initial  parameter  values,  the  expressions  for  the  adjoint  system  sources,  the  partial 
derivatives  of  the  surface  position  with  respect  to  the  design  parameters.  The  result  is  that  the 
optimization  process  becomes  entirely  automatic. 


A  learning  System 

The  sensitivity  information  and  optimization  process  described  above  can,  in  fact,  be  used  in 
a  stand  alone  mode  to  create  a  device  directly  from  the  specifications  [5],  This  is  a  time  consuming 
process  and  is  likely  only  to  be  used  when  no  previous  design  experience  is  available.  However,  once 
a  new  device  has  been  designed,  either  directly  from  the  specifications  or  by  the  modification  of  a 
previous  case(s),  it  can  be  added  to  the  case  database.  This  expands  the  range  of  experience  and 
knowledge  stored  in  the  database  and,  with  time,  increases  the  capability  and  speed  of  the  system.  In 
effect,  the  system  learns  with  each  new  design  completed. 


Conclusions 

This  paper  has  described  a  complete  device  synthesis  and  optimization  process  which  combines  the 
speed  of  a  case  based  reasoning  system  to  provide  good  starting  point  for  the  optimization  process.  The 
combination  of  case  based  reasoning  with  sensitivity  analysis  provides  a  mechanism  by  which  designs 
can  be  created  in  the  shortest  possible  time.  The  system  can  operate  either  in  a  stand  alone  mode  or, 
more  frequently,  with  a  designer  to  guide  the  choice  of  a  final  design. 


Acknowledgments 

The  authors  would  like  to  acknowledge  the  support  of  the  Natural  Science  and  Engineering  Research 
Council  of  Canada  during  this  work. 


364 


References 

[1]  S.Russenchuck,  "Deterministic  Optimization  Strategies  for  the  Design  of  Electromagnetic 
Devices,"  Proceedings  of  the  6th  International  IGTE  Symposium  on  Numerical  Field 
Calculation  in  Electrical  Engineering,  September  26-28,  Graz,  Austria,  pp.  233-244. 

[2]  G.F.Uler,  O.A.Mohammed,  C.S.Koh,  "Design  Optimization  of  Electromagnetic  Devices  Using 
Genetic  Algorithms,"  Accepted  for  publication  in  IEEE  Transactions  on  Magnetics,  Vol.  30, 
November  1994. 

[3]  J.Simkin,  C.W.Trowbridge,  "Optimizing  Electromagnetic  Devices  Combining  Direct  Search 
Methods  with  Simulated  Annealing,"  IEEE  Transactions  on  Magnetics,  Vol.  28,  No.  2,  1992, 
pp.  1545-1548. 

[4]  A.Gottvald,  K.Preis,  C.Magele,  O.Biro,  A.Savini,  "Global  Optimization  Methods  for 
Computational  Electromagnetics,"  IEEE  Transactions  on  Magnetics,  Vol.  28,  No.  2,  1992,  pp. 
1537-1540. 

[5]  D.N.Dyck  and  D.A.Lowther,  "The  Optimization  of  Electromagnetic  Devices  Using  Sensitivity 
Information,"  Proceedings  of  the  6th  International  IGTE  Symposium  on  Numerical  Field 
Calculation  in  Electrical  Engineering,  September  26-28,  Graz,  Austria,  pp.  251-256. 

[6]  J.Kolodner,  P.Simpson,  E.Sycara,  "A  Process  Model  of  Case-Based  Reasoning,"  Proceedings 
of  the  9th  International  Joint  Conference  on  Artificial  Intelligence,  Los  Angeles,  CA,  pp.  284- 
290. 

[7]  P.A.Devijver,  J.Kittler,  Pattern  Recognition:  A  Statistical  Approach,  Prentice-Hall,  1982. 

[8]  R.Rong  and  D.A.Lowther,  "The  Application  of  Case-Based  Reasoning  to  the  Initial  Design  of 
Electromagnetic  Devices,"  Proceedings  of  the  International  Symposium  on  Advanced 
Computational  and  Design  Techniques  in  Applied  Electromagnetic  Systems,  June  22-24,  1994, 
Seoul,  South  Korea,  to  be  published  in  Elsevier  Studies  in  Applies  Electromagnetics  in 
Materials. 

[9]  C.M.Saldanha,  "An  Algebraic  Constraint  System  for  CAD  in  Magnetics,"  M.Eng.  Thesis, 
McGill  University,  1988. 

[10]  I.Park,  B.T.Lee,  S.Y.Hahn,  "Sensitivity  Analysis  based  on  Analytic  Approach  for  Shape 
Optimization  of  Electromagnetic  Devices:  Interface  Problem  of  Iron  and  Air,"  IEEE 
Transactions  on  Magnetics,  Vol.  27,  No.  5,  1991. 

[11]  D.N.Dyck,  D.A.Lowther,  E.M.Freeman,  "A  Method  of  Computing  the  Sensitivity  of 
Electromagnetic  Quantities  to  Changes  in  Materials  and  Sources,"  IEEE  Transactions  on 
Magnetics,  Vol.  30,  No.  5,  1994,  pp.  3415-3418. 


365 


A  COMPUTER  PROGRAM  FOR  THE  DESIGN  OF 
SUPERCONDUCTING  ACCELERATOR  MAGNETS 

S.  Russenschuck, 

CERN,  1211  Geneva  23,  Switzerland 


CERN  is  preparing  for  the  construction  of  the  Large  Hadron  Collider  (LHC),  to  be  installed  in  the  exist¬ 
ing  LEP  tunnel.  The  LHC  requires  superconducting  magnets  which  are  characterized  by  the  dominance  of 
the  coil  geometry  for  the  field  distribution.  The  program  package  ROXIE  has  been  developed  for  the  design 
and  optimization  of  the  coil  geometries  in  two  and  three  dimensions.  The  paper  describes  the  features  of  the 
program,  the  mathematical  optimization  techniques  applied  and  gives  examples  of  the  recent  design  work 
carried  out. 

INTRODUCTION 

The  Large  Hadron  Collider  (LHC)  project  is  a  superconducting  accelerator  for  protons,  heavy  ions  and 
electron-  proton  collisions  in  the  multi-TeV  energy  range  to  be  installed  at  CERN  in  the  existing  LEP  tunnel 
with  a  circumference  of  about  27  km.  The  new  facility  will  mainly  consist  of  a  ring  of  high  field  supercon¬ 
ducting  magnets  cooled  to  1.9  K  with  superfluid  helium  [21].  The  main  dipole  magnets  will  operate  at  about 
8.4  T  and  the  quadrupoles  at  220  T/m  field  gradient.  The  superconducting  magnets  are  characterized  by 
the  dominance  of  the  coil  geometry  for  the  field  distribution.  Therefore,  the  design  computations  start  by 
optimizing  the  coil  geometry  in  two  and  three  dimensions,  using  analytical  integration  for  computing  the 
magnetic  field.  Contradictory  parameters  such  as  maximum  dipole  field,  minimum  content  of  unwanted  mul¬ 
tipoles  and  sufficient  safety  margin  for  the  conductor  must  be  optimized.  The  keystoning  of  the  conductors 
and  the  grading  of  the  current  densities  between  the  two  layers  complicate  the  task  of  finding  the  best  solu¬ 
tion.  The  report  describes  the  ROXIE  Fortran  program  (Routine  for  the  Optimization  of  coil  X-sections, 
Inverse  field  computation  and  coil  End  design)  which  has  been  developed  for  the  coil  optimization  and  which 
is  available  for  general  use  on  CERNs  RiskbOOO  workstation  cluster  PaRC. 


It  is  in  the  coil  end  regions  of  superconducting  magnets  where  most  of  the  quenches  occur.  Therefore 
special  attention  is  paid  to  the  design  of  the  coil  ends  with  the  objectives  of  a  low  peak  field  and  a  low 
integrated  multipole  content.  A  3D  option  in  ROXIE  allows  the  geometric  position  of  the  cables  and  the 
field  in  the  end  region  to  be  calculated  with  a  high  accuracy.  A  further  area  of  application  for  ROXIE 
is  the  need  to  trace  back  the  origins  of  measured  field  imperfections.  The  mechanical  dimensions  of  the 
active  parts  of  the  coils  are  impossible  to  verify  under  their  operational  conditions,  after  their  deformation 
due  to  manufacture,  warm  pre-stressing,  cool-down  and  excitation.  The  inverse  problem  solving  consists  of 
using  optimization  routines  to  find  distorted  coil  geometries  which  produce  exactly  the  multipole  content 
measured.  For  the  solving  of  the  optimization  problems  and  the  inverse  field  calculation  a  number  of  math¬ 
ematical  programming  techniques  have  been  implemented.  They  will  be  described  in  this  paper. 

FEATURES  OF  THE  ROXIE  PROGRAM 

The  development  of  the  program  was  driven  by  three  main  objectives:  To  write  an  easy-to-use  program 
for  the  design  of  superconducting  coils  considering  field  quality,  quench  margin  and  persistent  current  mul¬ 
tipoles.  to  include  the  program  into  a  mathematical  optimization  environment  and  to  develop  an  integrated 
design  tool  with  sophisticated  graphic  routines,  interfaces  to  CAD  systems,  numerical  field  calculation  codes 
and  a  5  axis  milling  machine. 

The  program  includes  routines  to  define  geometrically  coil  cross  sections  and  coil  ends  of  dipoles, 
quadrupoles,  sextupoles,  octupoles  and  decapoles  made  of  Rutherford  type  superconducting  cables.  The 
geometric  position  of  coil  block  arrangements  in  the  cross  section  of  the  magnets  is  calculated  from  given 
input  data  such  as  the  number  of  conductors  per  block,  conductor  type  (to  be  specified  in  a  separate  input 
file),  radius  of  the  winding  mandrel  and  the  positioning  and  inclination  angle  of  the  blocks,  fig.  1.  The 
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fact  that  the  keystoning  of  the  cables  is  not  sufficient  to  allow  their  edges  to  be  positioned  on  the  curvature 
of  a  circle,  is  fully  respected.  This  effect  increases  with  the  inclination  of  the  coil  blocks  versus  the  radial 
direction.  Rectangular  shaped  coil  blocks  are  also  possible  if  the  cable  is  not  key  stoned.  Cold  conditions 
can  be  simulated  by  means  of  a  contraction  coefficient.  Furthermore  the  coil  can  be  subjected  to  elliptic 
deformation.  The  input  parameters  for  the  coil  end  generation  are  the  z  position  of  the  first  conductor  of 
each  coil  block,  its  inclination  angle,  the  straight  section  and  the  size  of  the  wedges  between  the  conductors, 
fig.  5.  Four  options  are  available:  Coil  end  design  with  shims  between  the  turns  and  conductors  placed  on 
the  winding  mandrel,  fig.  5,  coil  end  with  grouped  conductors  aligned  at  the  outer  radius  of  the  endspacers, 
fig.  6,  coil  end  for  magnets  with  rectangular  cross  sections,  fig.  13,  and  racetrack  coil  ends.  It  is  assumed 
that  the  upper  edges  of  the  conductors  follow  ellipses  or  circles  in  the  developed  sz  plane  defined  by  their 
radial  position  in  the  straight  subsection  and  the  z  position  in  the  yz  plane,  fig.  7.  The  curvature  of  the 
lower  edge  of  the  conductor  is  then  calculated  assuming  constant  perimeter.  The  geometric  positions  of  bare 
conductors  can  be  printed  in  different  formats  suitable  for  other  numerical  field  calculation  packages  such  as 
ANSYS  and  POISSON.  In  addition  the  form  of  the  end  spacers  is  constructed  and  an  option  allows  to  print 
polylines  which  give  the  input  to  a  5  axis  milling  machine. 

The  field  calculation  part  allows  the  calculation  of  magnetic  field,  Lorentz  -  forces,  field  quality  (multipole 
content)  and  quench  margin.  Each  conductor  is  approximated  by  line  currents  carrying  the  same  current 
located  at  the  strand  position  inside  the  conductor.  The  grading  of  the  current  due  to  keystoning  of  the 
cable  is  therefore  considered.  By  applying  Biot-Savart’s  law  for  each  line  current,  the  field  is  calculated  at  a 
given  radius  in  the  aperture  and  the  peak  field  in  the  coil  itself  is  evaluated.  Thus  it  is  possible  to  calculate 
the  multipole  content  and  the  peak  field  in  the  coils  with  a  higher  accuracy  than  with  finite-element  calcu¬ 
lations  (this  is  useful,  as  the  multipoles  to  be  minimized  create  fields  of  the  order  of  10“^T  at  the  nominal 
radius).  The  influence  of  the  iron  yoke  can  be  taken  into  account  by  imaging  the  line  currents  thus  assuming 
that  the  yoke  iron  has  an  infinite  permeability.  It  is  foreseen  to  apply  the  volume  integral  method  for  the 
consideration  of  the  iron  magnetization.  For  the  peak  field  calculation  the  influence  of  the  self  field  of  each 
strand  is  neglected.  As  the  critical  current  density  has  been  measured  for  a  strand  without  compensation 
for  the  self  field  the  correct  load  line  characteristic  curves  are  obtained  when  the  self  field  is  neglected  in 
the  calculations.  An  option  allows,  however,  to  take  the  self  field  into  consideration.  The  self  inductance  of 
the  coil  is  investigated  by  calculating  the  vector-potential  in  the  coil  and  the  flux  linkage  is  then  evaluated 
by  means  of  Stokes  theorem,  neglecting  the  inner  inductance  of  the  strands.  From  the  given  self  inductance 
the  stored  energy  is  calculated.  There  is  an  option  for  the  automatic  calculation  of  the  mutual  inductances 
between  inner  and  outer  layer  blocks  in  both  the  upper  and  lower  coil.  During  ramp  of  the  superconducting 
LHC  magnets  from  0.58  to  8.4  T  nominal  dipole  field  additional  currents  (persistent  currents)  are  induced 
in  the  filaments.  These  currents  which  consist  of  surface  currents  and  currents  in  the  bulk  of  the  super¬ 
conductor  persist  for  long  periods  as  they  decay  only  through  flux  creep.  The  influence  of  these  currents 
on  the  field  quality  can  be  evaluated.  For  the  3D  field  calculation  each  conductor  is  sliced  up  into  ’’bricks” 
containing  line  currents  the  same  way  as  in  the  2  dimensional  case.  The  minimum  radius  of  curvature  in 
the  bricks  is  then  approximated  by  means  of  central  differential  quotients.  The  variation  of  the  field  on  the 
position  of  a  line  current  (roller  coaster  track)  can  be  plotted  versus  the  number  of  the  brick.  In  addition 
the  integrated  multipole  content  and  the  field  along  a  line  defined  by  their  3  start  and  3  end  coordinates 
can  be  calculated  and  plotted.  Another  feature  is  the  calculation  of  the  cross  talk  from  one  coil  end  to  the 
other,  i.e.  the  integrated  multipole  content  in  the  end  of  aperture  2  is  investigated  which  is  excited  by  the 
current  in  aperture  1.  For  the  integrated  multipole  content  the  iron  yoke  can  be  considered  by  means  of 
image  currents  whereas  for  3D  peak  field  calculations  the  influence  of  the  iron  yoke  has  to  be  neglected  as 
in  this  case  the  imaging  method  gives  incorrect  results.  Asymmetric  coil  ends  can  also  be  evaluated;  this  is 
necessary  for  the  connection  end  of  the  magnets  and  for  the  calculation  of  effects  coming  from  asymmetric 
coil  block  displacements  in  the  ends.  A  particular  effect  which  can  be  studied  is  a  different  length  of  the 
upper  and  lower  pole  of  the  dipole  magnets. 

From  the  beginning,  the  program  has  been  structured  to  allow  the  application  of  mathematical  opti¬ 
mization  techniques.  It  has  been  put  into  an  environment  of  mathematical  optimization,  with  numerous 
methods  available  for  decision  making,  treatment  of  nonlinear  constraints  and  optimization  algorithms.  All 
input  data  can  be  addressed  as  design  variables  of  the  optimization  problem  or  as  an  objective  (e.g.  when 
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a  certain  geometrical  dimension  has  to  be  minimized).  All  the  evaluated  field  quantities  can  be  addressed 
as  objectives  for  the  optimization.  Decision  making  methods  such  as  objective  weighting,  distance  function, 
constraint  formulation  or  the  automatic  set  up  of  payoff  tables  are  implemented.  Nonlinear  constraints  can  be 
treated  either  by  a  feasible  domain  method,  the  penalty  transformation,  the  augmented  Lagrangian  function 
or  the  boundary  search  along  active  constraints.  Optimization  algorithms  such  as  EXTREM,  Rosenbrock, 
Levenberg-Marquard,  Davidon-Fletcher-Powell  and  genetic  algorithms  are  implemented.  In  addition  there 
is  a  sensitivity  analysis  with  Lagrange  multiplier  estimation  available  in  order  to  find  the  hidden  resources 
of  a  particular  design. 

Sophisticated  graphic  routines  have  been  developed  for  display  of  geometry,  field,  forces  etc.  both  in  2D 
and  3D.  The  options  for  3D  contain  hidden  surface  removal,  and  depth  sort  algorithm  subroutines.  The 
program  uses  only  a  few  primitives  from  the  GKS  library  and  can  therefore  easily  be  modified  for  other 
platforms.  ROXIE  can  plot  the  geometry  of  the  coil  in  2  and  3D,  the  copper  wedges  between  coil  blocks 
and  the  end  spacers  for  the  coil  end  region.  Displacement  vectors  for  coil  positions  in  the  distorted  case  can 
be  plotted,  fig.  4,  as  well  as  short  circuit  currents  detected  by  inverse  field  calculation  from  time  dependant 
multipoles.  Optional  is  a  heading  with  comments,  time  and  date.  The  following  field  quantities  can  be  dis¬ 
played  in  the  coil  cross  section;  Magnetic  field  and  Lorentz  force  components  in  cartesian  or  local  conductor 
coordinate  systems.  Vector  potential  A,  current  on  strand  position,  current  density  in  the  element,  in  the 
superconductor  and  in  the  copper,  and  magnetization  vector  due  to  persistent  currents.  In  3D  the  graphic 
routines  include;  Field  along  a  '’roller  coaster”  track  in  the  conductor,  3D  magnetic  field  in  the  aperture 
represented  by  cones,  fig.  11,  minimum  and  maximum  local  field  in  the  coils,  Lorentz  forces  normal  to 
the  cable  surfaces,  fig.  14,  Lorentz  forces  (cartesian  vectors)  acting  on  each  cable,  and  minimum  radius  of 
curvature  of  the  conductor. 

Fields  of  application  are  the  calculation  and  optimization  of  symmetrical  magnet  structures  with  su¬ 
perconducting  coils  made  of  Rutherford  type  conductors,  calculation  of  effects  coming  from  asymmetries 
(geometric  errors  in  the  coils  due  to  manufacturing  process)  both  in  2D  and  3D  and  inverse  field  problem 
solving  for  tracing  back  these  effects  from  field  measurements.  Further  applications  are  combined  function 
magnets  of  various  kinds,  asymmetries  in  the  ramp  and  splice  region,  cross-talk  between  the  apertures  in 
the  two-in-one  type  magnets,  design  and  optimization  of  the  coil  end  region.  The  features  shall  be  described 
in  more  detail  with  an  example  from  each  of  the  main  fields  of  application. 


Fig.  1:  Optimized  dipole  coil  with  5  block  structure  Fig.  2:  Load  line  characteristic  curves  for  the  5  block 
(MBPN  and  MBSMSO  models).  Display  of  the  mag-  dipole  coil,  as  displayed  in  fig.l.  Block  2  outer  layer, 
netic  field  modulus.  block  5  inner  layer. 
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Fig.  3;  Cross  section  of  combined  dipole-sextupole  Fig.  4:  Displacement  vectors  for  inverse  field  problem 
corrector,  with  field  vector  display  and  |  B  |  in  coils  solving  for  the  main  dipole  model  magnet  MTA. 
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Fig.  5:  Cut  of  main  quadrupole  end  with  conductors  Fig.  6:  Cut  of  model  dipole  end  MBSMSO  with 
placed  on  the  winding  mandrel.  grouped  conductors  aligned  on  the  outer  radius. 


Fig.  7:  Developed  view  on  conductors  in  the  dipole 

end  with  grouped  conductors,  outer  layer,  right:  up-  Fig.  8:  Polygons  for  endspacer  machining  with  5  axis 
per  edge,  left:  lower  edge.  milling  machine  for  dipole  model  magnet. 
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Fig.  9:  3D  representation  of  endspacers  for  dipole 
model  with  grouped  conductors  (MBSMSO  model), 


Fig.  11:  3D  representation  of  coil  end  of  the  MBTRA 
dipole  model  magnet  with  magnetic  field  cones. 


Fig.  10:  3D  representation  of  coil  end  of  the  MBFISC 
dipole  model  magnet  with  coordinate  system,  inner 
layer  only 


Fig.  12:  3D  representation  of  coil  end  of  the  insertion 
quadrupole 


Fig.  13:  3D  representation  of  coil  end  for  a  insertion  Fig.  14:  3D  view  of  sextupole  corrector  coil,  with 
quadrupole  design  with  rectangular  coil  cross  section,  pressures  due  to  Lorentz-forces  on  cables. 
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MATHEMATICAL  OPTIMIZATION  TECHNIQUES 


Mathematical  optimization  including  numerical  techniques  such  as  linear  and  nonlinear  programming, 
integer  programming,  network  flow  theory,  and  dynamic  optimization  has  its  origin  in  operations  research  de¬ 
veloped  in  response  to  military  needs  of  world  war  II.  An  optimality  criterion  for  optimization  problems  with 
conflicting  objectives  has  been  introduced  by  Pareto  in  1896  [20].  The  solution  process  for  vector-optimization 
problems  is  threefold,  based  on  decision  making  methods,  methods  to  treat  nonlinear  constraints  and  op¬ 
timization  algorithms  to  minimize  the  objective  function  (we  shall  call  this  an  optimization  procedure). 
Methods  for  decision  making  have  been  introduced  and  applied  on  a  wide  range  of  problems  in  economics  by 
e.g.  Marglin  1966  [16]  and  Fandel  1972  [3].  The  theory  of  nonlinear  programming  with  constraints  is  based 
on  the  optimality  criterion  by  Kuhn  and  Tucker,  1951  [14]  which  provided  the  basis  for  later  developments 
in  mathematical  programming.  Numerous  optimization  algorithms  both  using  deterministic  and  stochastic 
elements  have  been  developed  in  the  sixties.  Researchers  tend  to  come  back  to  genetic  and  evolutionary  al¬ 
gorithms  recently  as  they  are  suited  for  parallel  processing,  Fogel  1994  [6],  Holland  1992  [10].  While  desktop 
computing  power  to  date  is  available  which  was  ten  years  ago  only  accessible  in  large  computer  centers,  the 
combination  of  mathematical  optimization  techniques  and  field  calculation  has  become  a  modern  tool  for 
electrical  engineering. 

Characteristic  for  real-world  optimization  problems  are  multiple  conflicting  objectives  which  should  be 
considered  simultaneously  Usually  the  individual  solutions  for  each  single  objective  function  differ  and  a 
solution  that  maximizes  one  objective  will  not,  in  general,  maximize  any  of  the  other  objectives.  Any  im¬ 
provement  of  one  objective  can  be  achieved  only  at  the  expense  of  another.  For  the  definition  of  the  optimal 
solution  of  these  so  called  vector-optimization  problems  we  apply  the  optimality  criterion  by  Pareto  1896 
originally  introduced  for  problems  in  economics.  A  Pareto- optimal  solution  (for  vector  maximum  problems 
also  called  noninferior  solution)  is  given,  if  there  exists  no  other  solution  that  wdll  yield  an  improvement  in 
one  objective  without  degradation  in  at  least  one  other  objective.  It  is  clear  that  this  yields  a  set  of  solutions 
rather  than  one  unique  solution.  The  decision  maker  (who  is  not  always  the  design  engineer  undertaking  the 
optimization)  usually  picks  a  solution  out  of  a  solution  set  intuitively.  Applying  mathematical  optimization 
routines,  however,  requires  a  formalized  decision  making  method.  Below  some  methods  of  decision  making 
are  briefly  described.  It  can  be  proved  that  they  possess  a  substitute  problem  to  the  vector  optimization 
problem  with  a  Pareto- optimal  solution  (assuming  convexity  of  the  objective  functions).  A  comprehensive 
overview  on  decision  making  can  be  found  in  Cohon,  1978  [2]. 

The  objective  weighting  function  is  the  sum  of  the  weighted  objectives  fk{X)  and  results  in  the 
minimization  problem:  min{u{F[X))  =  U  ■  h{^))  with  the  weighting  factors  tk  representing  the 

users  preference.  The  design  variables  are  comprised  in  the  vector  X  .  The  problem  is  to  find  the  appro¬ 
priate  weighting  factors  in  particular  when  the  objectives  have  different  numerical  values  and  sensitivity. 
Using  objective  weighting  results  therefore  in  an  iterative  solution  process  where  a  number  of  optimiza¬ 
tions  have  to  be  performed  with  updated  weighting  factors,  a  cumbersome  process  if  the  decision  maker  is 
not  the  analyst  undertaking  the  calculations.  The  problem  of  choosing  the  weighting  factors  appropriately 
also  occurs  when  the  distance  function  method  is  applied.  Most  common  is  a  least  squares  objec¬ 
tive  function.  With  being  the  requirements  for  the  optimum  design,  the  minimization  problem  reads: 
min  |[  z{X)  |||  =  min  Ef=i  “  fkW)^-  The  disadvantage  of  least  squares  objective  functions 

with  the  euclidean  norm  [j  .  j{2  is  the  low  sensitivity  for  residuals  smaller  than  one.  Therefore  sufficiently 
high  weighting  factors  tk  have  to  be  introduced.  If  the  absolute  value  norm  is  applied  the  disadvantage  is 
the  nondifferentiable  objective  function  in  the  optimum  which  makes  the  application  of  Newton-methods 
impossible.  The  problem  with  the  weighting  factors  can  be  overcome  by  defining  the  problem  in  the  con¬ 
straint  formulation.  Only  one  of  the  objectives  is  minimized  and  the  others  are  considered  by  constraints. 
The  constraints  represent  the  minimum  request  values  specified  by  the  user  for  the  particular  objective. 
The  problem  is  that  for  badly  chosen  constraints  there  exists  no  feasible  solution  at  all,  whereas  non  active 
constraints  represents  a  zero  weight  for  the  particular  objective.  However,  starting  with  a  design  where  some 
engineering  expertise  was  already  put  into,  the  analyst  should  be  able  to  define  appropriate  constraints.  The 
constraint  formulation  has  the  advantage  that  a  sensitivity  analysis  by  means  of  a  Lagrange  multiplier 
estimation  can  be  performed  using  the  necessary  optimality  conditions  for  constraint  optimization  problems 
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introduced  by  Kuhn  and  Tucker.  The  interesting  characteristic  of  the  Lagrange  muitipliers  of  the  corre¬ 
sponding  Lagrange  function  to  the  constraint  problem  is  that  they  are  a  measure  of  the  price  which  has  to 
be  paid  when  the  constraint  is  decreased.  The  decision  maker  can  then  explore  the  hidden  resources  of  a 
design.  A  tool  which  provides  the  decision  maker  with  a  lot  of  information  about  the  hidden  resources  of  a 
design  is  the  payoff  table.  To  create  this  table  K  individual  constraint  optimization  problems  are  solved 
to  find  the  best  solution  for  each  of  the  K  objectives.  Best  compromise  solutions  can  then  be  found  by 
minimizing  the  ’’distance”  from  the  in  general  nonfeasible  ’’perfect  solution”  applying  different  norms  e.g. 
L\ .  Lo  and  Loo  * 

Decision  making  is  the  most  important  part  and  influences  the  convergence  behavior  of  the  whole  opti¬ 
mization  procedure.  For  the  minimization  of  constrained  and  unconstrained  objective  functions  numerous 
methods  have  been  proposed  in  the  last  30  years.  It  is  most  important  to  find  the  right  minimization  method 
to  fit  with  the  method  of  decision  making. 

As  the  design  variables  of  the  optimization  problem  can  usually  only  be  varied  between  upper^and 
lower  bounds,  a  modified  objective  function  is  applied  p{X)  =  /(A)  if  no  bound  is  violated  and  p(A'’)  = 
/(A“)  r{X)  if  a  bound  is  violated.  A"  =  (xi,  X2,  ...Xn)  and  r[X)  =  (x;  —  x;u)“  =  ^lu  if  >  ^lu 

,  r{X)  =  {xii  —  xi)^  X*  =  xii  if  X/  <  Xij.  n  is  a  sufficiently  high  penalty  parameter.  The  advantage  of  this 
procedure  is  that  the  violation  of  the  bounds  are  checked  before  a  function  evaluation  is  carried  out  and  that 
the  existing  algorithms  for  unconstrained  minimization  can  be  applied  without  modifications.  The  objective 
weighting  function  as  well  as  the  distance  function  method  allows  the  immediate  application  of  an  algorithm 
for  finding  the  minimum  value  of  an  unconstrained  objective  function  as  there  are  no  nonlinear  constraints 
to  be  considered.  With  the  constraint  formulation  the  problem  of  the  treatment  of  the  nonlinear  constraints 
arises.  One  method  is  the  transformation  of  the  constrained  problem  into  a  unconstrained  problem  by  means 
of  a  penalty  function  method.  The  main  idea  is  to  add  a  penalty  term  to  the  objective  function  which 
depends  on  the  degree  to  which  the  constraints  are  violated  and  vanishes  if  the  constraints  are  satisfied. 
The  constraint  optimization  problem  minf{X)  subject  to  equality  constraints  hj{X)  =  0  and  inequality 
constraints  gi{X)  <  0  transformed  into  the  penalty  function  to  be  minimized  reads: 

p{X)~f{X)  ~\-J2pi-max^{0.,gi{X)-Ci)  -h  (A)  -  (1) 

i  j 

In  order  to  achieve  feasibility  of  the  result  the  penalty  factors  pf,  pj  have  to  be  chosen  relatively  large. 
Large  penalty  factors,  however,  lead  to  ill-conditioned  function  topologies,  and  the  optimization  algorithms 
need  many  function  evaluations  to  find  the  minimum.  It  was  therefore  proposed  to  solve  a  sequence  of 
unconstrained  minimization  problems  with  increasing  penalty  factors,  SUMT  or  to  add  the  penalty  terms 
to  the  corresponding  Lagrange  function,  method  of  augmented  Lagrangian.  Replacing  the  squares  in 
(1)  by  the  absolute  value  leads  to  the  exact  penalty  transformation  where  the  penalty  factors  can  be 
reduced.  The  problem  here  is  the  nondifferentiability  of  the  objective  function  for  active  constaints.  If  the 
number  of  design  variables  does  not  exceed  5,  the  method  of  boundary  search  along  active  constraints 
can  efficiently  be  used.  The  ROXIE  user  has  the  choice  between  the  above  mentioned  methods.  It  needs 
some  experience  to  select  an  efficient  procedure  consisting  of  decision  making,  treatment  of  nonlinear  con¬ 
straints  and  a  suitable  optimization  algorithm. 

The  following  optimization  algorithms  are  available:  Optimization  algorithm  EXTREM  by  Jacob  [11] 
which  consists  of  one-dimensional  minimizations  by  means  of  Powell  extrapolations  [22]  in  a  main  search 
direction  (user  supplied)  and  a  orthogonal  direction  evaluated  by  Gram-Schmidt  orthogonalization.  After 
these  one-dimensional  searches  have  been  carried  out  (end  of  the  i-th  step)  the  main  search  direction  is  up¬ 
dated  by  a  vector  pointing  from  the  initial  outline  to  the  minimum  of  the  i-th  step.  The  user  has  to  supply 
an  initial  step-size  which  is  taken  to  (xfu  —  x/f)/10..  This  user-friendly  algorithm  is  suitable  for  practically  all 
applications.  The  parametric  study  evaluates  the  objective  functions  for  all  the  combinations  of  different 
design  variables  where  for  each  design  variable  10  different  values  are  taken.  As  the  total  number  of  function 
evaluations  is  therefore  10"  (for  n  design  variables)  the  parametric  study  is  only  appropriate  for  up  to  3 
design  variables.  The  sensitivity  analysis  calculates  the  sensitivity  due  to  errors  in  the  design  variables. 
A  Jacobian  Matrix  is  printed  out.  It  consists  therefore  of  n-fl  function  evaluations.  The  Levenberg- 
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Marquard  algorithm  [13]  was  originally  developed  for  nonlinear  regression  problems  using  least  squares 
objective  functions.  We  can  apply  it  therefore  efficiently  to  the  distance  function  problem.  To  check  the 
necessary  optimality  conditions  for  constraint  problems  (Kuhn-Tucker  equations)  a  Lagrange-Multiplier 
estimation  can  be  carried  out  minimizing  the  corresponding  Lagrange-  function.  The  Lagrange  function 
is  minimized  using  the  Davidon-Fletcher-Powell  algorithm  where  the  step-size  and  direction  is  given  by 
AX  =  —H~‘^Vf{X)  The  Hessian  Matrix  H  does  not  have  to  be  calculated  in  each  point  but  is  updated 
iteratively  beginning  with  the  unity  Matrix  I.  As  the  ”  design  variables”  are  the  Langrange  -  multipliers  a,  /? 
to  be  estimated,  the  derivatives  in  Oi,  /?ican  be  approximated  with  a  high  accuracy.  That  is  the  reason  for 
the  good  convergence  behavior  of  this  method  when  applied  to  the  Lagrange  multiplier  estimation.  It  can, 
however,  be  applied  also  to  all  the  other  design  optimization  problems,  the  convergence  behavior  is  then 
dependent  upon  the  function  topology.  In  genetic  algorithms,  each  trial  solution  is  coded  as  a  vector 
(chromosome)  X  with  elements  being  described  as  genes.  Holland  [10]  suggested  that  the  chromosomes  are 
represented  by  binary  strings.  It  is  obvious  that  this  algorithm  is  specially  suited  for  integer  programming. 
From  two  sets  of  chromosomes  offspring  are  produced  by  use  of  genetic  operators  such  as  crossover  and 
mutation.  Crossover  means  the  interchanging  of  sections  of  the  chromosomes  between  two  parent  chro¬ 
mosomes.  The  position  where  the  chromosomes  are  split  into  two  sections  is  chosen  randomly.  Mutation 
means  that  with  a  certain  likelihood  single  bits  in  the  chromosomes  are  flipped.  As  an  example,  let  be  the 
parent  chromosomes  OOOlOOllOjlOlOlll  and  110001010|  1100011  ,  then  the  offspring  is  created  by  crossover 
(I  indicates  the  crossover  point)  000100110|  1100011,  110001010|  10101 11.  In  order  to  avoid  the  dominance 
of  "superindividuals”  which  leads  to  an  uniform  population  unable  of  further  evolution,  a  sufficiently  large 
gene  pool  has  to  be  chosen  from  which  offspring  are  created  by  mating  pairs  of  chromosomes.  The  likelihood 
of  reproduction  for  each  chromosome  is  then  proportional  to  the  objective  function  value  for  the  particular 
design  represented  by  the  chromosome.  From  the  offspring  the  fittest  strings  are  used  for  the  next  genera¬ 
tion.  Only  about  60%  of  the  population  is  used  for  crossover  (crossover  probability  0.6).  The  others  simply 
survive  and  ensure  the  genetic  diversity  of  the  population.  The  bit  mutation  probability  is  0.001  which 
means  that  only  every  1000  th  bit  is  mutated.  The  problem  of  genetic  algorithms  is  the  dependance  of  the 
solution  on  user  supplied  and  problem  dependent  input  such  as  crossover  probability,  size  of  the  gene  pool 
etc.  More  research  has  to  go  into  making  the  algorithm  more  robust  for  every  day  use  in  optimization  of 
superconducting  magnets. 

EXAMPLES 

Dipole  coil  optimization  (cross  section) 

Because  of  the  length  of  the  dipole  magnets  (about  13  m)  the  design  optimization  starts  with  the  2D 
cross-section  of  the  coils.  The  design  variables  for  this  optimization  problem  are  the  positioning  angles  and 
the  inclination  angles  of  the  coil  blocks  (8  or  10  design  variables  depending  on  the  number  of  blocks  chosen), 
c.f.  fig.  1. 

The  field  quality  of  a  magnet  is  usually  described  by  the  Bn  (called  the  normal)  and  the  (called  the 
skew  components)  of  the  field  defined  by  the  expansion  of  the  complex  field:  By  -l-iBx  = 
with  z  =  x  +  iy  =  =  r(cos^  -|-  isin^^)  and  By,  -H  iBr  -  {By  +  iBr)e^‘^  .  In  ROXIE  the  r-  component 

of  the  field  is  calculated  directly  by  means  of  the  Biot-Savarts  law.  Br{^)  at  a  given  radius  r  =  ro  is  then 
harmonically  analyzed  by  means  of  the  program  TRICOF  from  the  CERN  Library  which  has  been  set  up 
for  double  precision  for  higher  accuracy.  With  A  being  usually  defined  as  the  coefficients  of  the  cos  terms 
and  B  being  the  coefficients  of  the  sin  terms  we  get  from  given  equidistant  function  values  in  the  interval 
-TT  <  (p  <  tt:  Br{ro)  =  J]^_i(Bn(ro)  sin  -f*  A„(ro)  cos  n^)  =  Bo(ro)  53^=1  -h  a„  cos n^s).  The 

A  and  B  terms  are  in  agreement  with  the  above  definition  for  the  unit  radius.  The  optimization  problem 
possesses  a  number  of  different  objectives  which  are  partially  contradictory: 
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•  A  main  dipole  field  of  8.65  T 

•  A  small  sextupole  field  comp.  (63  =  min)  and  a  small  decapole  field  comp.  (65  =  min) 

•  67  <  0.02  •  10“^  and  69  <  0.005  •  lO"'^ 

•  Minimum  volume  of  superconductor 

•  Maximum  margin  to  quench 

•  Small  multipole  content  due  to  persistent  currents 

By  applying  the  objective  weighting  function  with  recursively  updated  weighting  factors  and  the  algo¬ 
rithm  EXTREM,  a  design  with  only  five  coil  blocks  could  be  found  which  perfectly  meets  the  requirements 
concerning  field  quality  as  given  from  beam  simulations.  All  previous  design  studies  considered  a  six  block 
design.  Numerical  results  are  given  in  Table  2  (MBPN  and  MBS  MS  0  models),  the  cross  section  is  displayed 
in  fig.  1.  The  deterministic  optimization  procedure  converges  very  well  and  minimizes  the  weighting  function 
usually  in  about  200  function  evaluations.  However,  for  each  run  the  number  of  blocks  and  the  number  of 
conductors  per  block  has  to  be  selected  and  kept  constant.  This  way,  the  optimization  routine  cannot  be 
used  to  discover  a  new  design  principle. 

An  approach  for  a  ’’creative”  design  tool  could  be  the  use  of  genetic  algorithms.  As  they  are  specially 
suited  for  integer  programming  a  optimization  problem  was  set  up  with  predefined  conductors  (48  in  the 
outer  layer,  26  in  the  inner  layer)  where  by  means  of  genetic  algorithms  the  current  was  switched  on  and  off, 
thus  resulting  in  bit  strings  (genes)  of  size  74.  The  size  of  the  gene  pool  was  65,  the  crossover  probability  0.6 
and  the  bit  mutation  rate  0.001.  As  in  this  case  the  number  of  conductors  is  not  fixed,  a  minimum  number 
of  turns  was  included  as  an  additional  objective  in  the  weighting  function.  Fig.  15  shows  some  intermediate 
steps  of  the  minimization  after  65,  195  and  4550  function  evaluations.  The  new  idea  which  could  be  derived 
is  that  by  piling  up  some  more  conductors  in  the  outer  layer  results  in  a  shielding  effect  and  the  pole  angle 
of  the  inner  layer  can  be  increased.  A  design  derived  from  this  idea  using  the  same  cable  as  for  the  5  block 
design  (fig.  1)  can  be  seen  in  fig.  15  on  the  right.  Both  designs,  although  quite  different,  use  the  same 
amount  of  superconductor,  have  both  the  same  high  field  quality  and  the  same  margin  to  quench  in  the 
inner  layer.  Advantageous  of  the  design  in  fig.  15  is  the  larger  pole  angle  in  the  inner  layer,  which  results  in 
easier  to  wind  coil  ends.  Disadvantages  are  the  higher  current  density  in  the  cable  necessary  to  produce  the 
same  main  field  (resulting  in  higher  hot  spot  temperatures  at  quench)  and  the  reduced  margin  to  quench  in 
the  outer  layer. 


Fig.  15:  Intermediate  steps  of  the  optimization  using  genetic  algorithms  after  65,  195  and  solution  after 
4550  function  evaluations  (from  left  to  right).  A  feasible  design  derived  from  the  solution  by  means  of  de¬ 
terministic  methods  using  the  same  cable  as  the  5  block  design  (fig.l)  is  displayed  on  the  right. 

The  main  quadrupole  as  an  inverse  field  problem 

With  a  slightly  modified  input  file  ROXIE  is  able  to  calculate  multipole  errors,  peak  field  etc.  in  asym¬ 
metric  coil  block  arrangements.  This  is  the  basis  of  the  inverse  field  calculation  option  for  the  tracing  back  of 
conductor  displacements  from  given  field  harmonic  measurements.  These  displacements  can  well  be  asym¬ 
metric  thus  requiring  individual  treatment  of  each  coil  block, 

The  main  parameters  of  the  lattice  quadrupoles  for  LHC  are  a  nominal  gradient  of  252  T/m,  a  mag¬ 
netic  length  of  3.05  m,  a  nominal  current  of  15060  A,  an  inner  coil  aperture  of  50  mm  and  an  operational 
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temperature  of  1.8  K.  Before  being  assembled  into  their  common  yoke  the  two  coil-collar  assemblies  of  the 
second  magnet  have  undergone  magnetic  measurements  at  room  temperature.  Table  1  gives  the  measured 
multipole  distribution  in  the  straight  part  of  one  of  these  assemblies  together  with  the  expected  (intrinsic) 
values. 


Measured 

Intrinsic 

n 

Normal 

Skew 

Normal 

Skew 

1 

- 

- 

- 

- 

2 

-21910. 

- 

-21910. 

- 

3 

1.972 

-5.456 

- 

- 

4 

-0.723 

-0.920 

- 

- 

5 

-0.219 

-0.436 

- 

- 

6 

-0.066 

0.153 

-0.79 

- 

7 

-0.004 

-0.285 

- 

- 

8 

-0,002 

0.109 

- 

- 

9 

-0.043 

-0.022 

- 

- 

10 

0.59 

-0.044 

0.52 

- 

Table  1 :  Measured  and  intrinsic  multipole  content  of 
the  warm  quadrupole  coils. 

Fig.  16:  Displacement  vectors  for  the  quadrupole 

The  function  to  be  minimized  in  the  inverse  field  computation  problem  is  a  least  squares  objective  function 
minz{X)  =  rnin  pi  •((/i(X))^-|-(pt(X))“)  with  the  residuals  fi{X)  =  b'^{X)  —  bi  and  gi{X)  =  a*{X)~ai 
where  (X)  are  the  calculated  and  bi,ai  are  the  measured  multipoles.  X  is  the  vector  of  the  design 

variables  for  the  inverse  problem.  The  p,-  are  weighing  factors  in  order  to  compensate  for  the  different  nu¬ 
merical  values  of  the  residuals.  The  design  variables  are  the  possible  perturbations  of  the  coil  blocks,  which 
determine  the  content  of  the  multipole  components.  Because  of  the  non-symmetric  nature  of  the  geometrical 
coil  positioning  errors  a  high  number  of  design  variables  results  for  the  inverse  field  problem.  It  had  therefore 
to  be  assumed  that  the  positioning  errors  hold  for  an  entire  coil  block  rather  than  individual  conductors. 
Constraints  on  the  block  displacements  were  introduced:  At  the  poles  the  collar  inserts  represent  a  limitation 
to  any  azimuthal  motion.  The  blocks  adjacent  to  the  horizontal  or  vertical  planes  move  together  azimuth  ally; 
this  motion  may  be  different  for  the  two  layers.  In  this  way  a  difference  of  elastic  moduli  in  the  coil  layers  is 
accounted  for.  Each  block  is  allowed  to  move  in  the  radial  direction  resulting  in  a  total  number  of  32  design 
variables  for  the  inverse  field  problem.  It  was  solved  by  means  of  the  Levenberg-Marquard  algorithm  with 
about  800  function  evaluations.  The  result  is  displayed  in  fig.  16.  The  displacements  show^  no  significant 
inside  movement  of  inner  layer  blocks.  The  adopted  collaring  procedure,  where  the  coil  assembly  mandrel  is 
extracted  before  the  final  compression  of  the  collars,  seems  therefore  acceptable. 

Coil  end  optimization 

The  design  variables  for  the  optimization  are  the  length  of  the  straight  section  of  each  block,  the  ellipticity 
of  the  innermost  turn  of  each  block  and  the  inclination  angle  of  each  of  the  blocks  in  the  yz  plane,  c.f.  fig.  6. 
The  positioning  angle  and  inclination  angle  for  defining  the  cross  section  (xy  plane)  (fig.  1)  of  the  magnet  was 
kept  constant  as  found  by  the  2D  cross  section  optimization.  The  objectives  for  the  coil  end  optimization  are: 

•  Minimum  integrated  multipole  content  in  the  coil  end 

•  Low  enhancement  of  the  field  in  the  coil 

•  Minimization  of  the  variation  of  pressure  on  the  narrow  face  of  the  conductor  due  to 
Lorentz  forces  in  order  to  avoid  slip  between  turns 

•  Maximization  of  the  minimum  radius  of  curvature  in  each  coil  block 

•  Short  physical  length  of  the  end 

A  short  end  length  and  homogenous  Lorentz  force  distribution  in  the  innermost  turn,  which  can  be 
achieved  by  placing  the  inner  block  of  the  outer  layer  closer  to  the  straight  section  of  the  magnet,  leads 
to  a  higher  peak  field  to  main  field  ratio.  The  problem  was  solved  with  objective  weighting  functions  and 
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successively  updated  weighting  factors  and  the  algorithm  EXTREM.  Table  2  shows  the  results  of  the  field 
calculations  for  different  versions  of  model  dipole  magnets.  2D  cross  section  optimizations  have  been  carried 
out  for  the  models  MBFISC,  MBTRA  and  MBPN,  but  only  the  MBPN  and  the  MBSMSO  coil  end  (which 
have  the  same  cross  section)  has  been  optimized  using  the  described  methods.  For  the  optimization  of  the 
integrated  multipole  content  the  iron  yoke  is  assumed  to  end  at  the  onset  of  the  coil  end  (z=0),  in  order  to 
reduce  the  peak  field  in  the  coil  end  region.  The  peak  field  is  related  to  the  dipole  field  which  would  occur 
for  the  bare  coil. 


Design 

MTA 

MBFISC 

MBTRA 

MBPN 

MBSMSO 

Aperture  Diameter  (mm) 

50. 

56. 

56. 

56. 

Cable  height,  inner  layer  (mm) 

17. 

16.7 

12.3 

15. 

Cable  height,  outer  layer  (mm) 

17. 

16.7 

11.7 

15. 

Number  of  turns 

74 

82 

98 

82 

Beam  distance  (mm) 

180. 

200. 

180. 

180. 

Yoke  radius  (mm) 

93. 

91.5 

82. 

98 

Length  of  coil  end  (mm) 

215. 

163. 

140. 

191. 

178. 

Cable  length,  Inner  layer  (mm) 

3533 

4473 

8091 

5985. 

4591. 

Cable  length,  Outer  layer  (mm) 

10229 

11658 

12830 

12752. 

11943. 

Nominal  current  (A) 

13987. 

12844. 

8718. 

11614. 

Field  at  nominal  current  (T) 

9.85 

9.46 

8.65 

8.65 

Quench  field  (T) 

10.37 

9.96 

9.43 

9.65 

Self  inductance  (mH/m) 

6.42 

7.82 

11.41 

’.64 

NI/B 

105063. 

111352. 

98773. 

110097. 

63(10'“'^  at  r=10  mm) 

0.690 

0.046 

0.15 

-0.345 

65(10“^  at  r=10  mm) 

0.110 

0.0015 

0.0300 

0.0244 

67(10“'^  at  r=10  mm) 

0.150 

-0.0088 

0.0173 

0.0060 

69/(10"'^  at  r=10  mm) 

0.020 

0.0022 

-0.0005 

0.0000 

723(10”^  at  r=10  mm) 

-4.72 

-3.09 

-3.62 

3.62 

715(10”^  at  r=10  mm) 

0.332 

0.111 

0.121 

0.150 

777(10”^  at  r— 10  mm) 

-0.019 

-0.019  : 

-0.027 

-0.027 

77g/(10“'^  at  r=10  mm) 

0.009 

0.004 

0.005 

0.006 

Peak  field/main  field,  inner  layer,  2D 

1.0235  : 

1.0327 

1.057 

1.051 

Peak  field/main  field,  outer  layer,  2D 

0.856  ' 

0.908 

0.892  1 

0.875 

Peak  field/main  field,  inner  layer,  3D 

1.0473 

1.0668 

1.107 

1.100 

1.18 

Peak  field/main  field,  outer  layer,  3D 

0.921 

0.992 

0.974 

0.923 

0.709 

AP  [Nlmm^) 

15.3 

10.6 

19.2 

18.6 

15.9 

Fz,  inner  layer  (N)  ' 

14460. 

10171. 

16827. 

13279. 

13940. 

Fz,  outer  layer  (N) 

29406. 

36897. 

25645.  ' 

31641. 

31419. 

EUipticity  ratio,  Inner  layer 

0.998 

0.963 

0.586 

0.864 

0,714 

EUipticity  ratio,  Outer  layer 

1.027 

0.939 

0.638 

0.636 

0.762 

Min.  radius  of  curvature,  inner  layer 

4.62 

14.36 

13.89 

6.24 

14.32 

Min.  radius  of  curvature,  outer  layer 

31.1 

15.78 

34.54 

37.24 

30.32 

at  r=10  mm) 

52.47 

22.55 

-33.07 

0.000 

2.05 

at  r=10  mm) 

-0.22 

0.189 

-1.68 

0.000 

-0.0065 

2)7*(10“^  at  r=10  mm) 

0.135 

-0.0036 

-0.077 

0.0049 

-0.0442 

ii|*(10“'^  at  r=10  mm) 

0,0215 

0.0069 

-0.0031 

0.0048 

-0.0071 

Table  2:  Results  for  the  2D  and  3D  coil  optimizations,  hn  normcJ  relative  multipoles  (2D),  Un  normal  relative  multi¬ 
poles  due  to  persistent  currents  at  injection  energy.  Integration  path  for  relative  integrated  multipole  errors  6“  Roni 
z=-100mm  to  z=250  mm.  Iron  yoke  ends  at  z— 0.  EUipticity  ratio  given  for  the  upper  edge  of  the  innermost  turn  in 
each  layer,  AP  =  Variation  of  pressure  due  to  Lorentz  forces  on  the  narrow  face  of  the  innermost  conductor  of  the 
inner  layer.  AP  and  F-  are  given  for  a  current  that  would  give  8.65  T  with  a  iron  yoke  of  85  mm. 

The  multipole  content  given  for  the  cross  section  shows  that  the  MBFISC, MBTRA  and  MBPN  dipole 
models  have  been  already  successfully  optimized  in  two  dimensions.  The  MBPN  and  MBSMSO  dipole  mod¬ 
els  show  significantly  reduced  integrated  multipole  content  in  the  ends.  The  peak  field  to  main  field  ratio 
is  difficult  to  reduce  since  for  the  cross  section  optimization  more  turns  are  added  in  the  second  block  of 
the  outer  layer.  This  is  done  in  order  to  reduce  the  6?  and  69  components,  but  considerably  increases  the 
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peak  field  in  the  coil  ends  which  cannot  be  compensated  with  the  end  design.  This  effect  is  particularly 
pronounced  in  the  MBTRA  model  which  uses  the  small  cable  of  only  about  12  mm  height  and  the  5  block 
design  in  the  cross  section.  The  reduction  of  the  variation  of  Lorentz  forces  acting  on  the  narrow  face  of 
the  conductor  which  constitutes  a  source  of  slip  between  the  conductors  is  very  difficult.  It  can  be  done  by 
shifting  the  inner  block  of  the  outer  layer  towards  the  center  of  the  magnet  as  it  is  done  for  the  MBFISC  and 
the  MBSMSO  design.  The  objective  conflicts  with  the  required  minimum  field  enhancement  in  the  ends. 
Therefore  a  dipole  model  will  be  built  with  a  reinforced  mechanical  construction,  the  so  called  end  clamping 
cage. 

ACKNOWLEDGEMENT 

I  thank  T.  Tortschanoff  and  N.  Siegel  for  the  discussions  and  ideas  which  have  been  incorporated  into 
the  program,  which  at  the  current  3.4  version  led  to  more  than  14000  lines  of  computer  code.  Many  thanks 
to  R.  Wolf  who  contributed  the  subroutines  for  the  calculation  of  the  persistent  current  effects.  I  would  like 
to  express  my  gratitude  to  S.  Masso  who  installed  the  program  on  CERN’s  parallel  RISC6000  workstation 
cluster  PaRC,  as  well  as  to  all  colleagues  and  users  of  ROXIE  who  helped  me  testing  the  versions  or  provided 
me  with  their  designs  which  I  could  use  as  examples. 


References 

[1]  Bertsekas,  D.P.:  Constrained  optimization  and  Lagrange  multiplier  methods,  Academic  Press,  1982 

[2]  Cohon,  J.L.:  Multiobjective  Programming  and  Planning,  Academic  Press,  New  York,  1978 

[3]  Fandel,  G.:  Optimale  Entscheidung  bei  mehrfacher  Zielsetzung,  Springer,  Berlin  1972 

[4]  Fiacco,  A.V.,  McCormick,  G.P.:  Sequential  unconstrained  minimization  techniques,  Wiley,  1968 

[5]  Fletcher,  R.,  Powell,  M.J.D.:  A  rapidly  convergent  descent  method  for  minimization.  Computer  Journal, 
Vol.  6,  1963 

[6]  Fogel  D.B.:  An  Introduction  to  Simulated  Evolutionary  Optimization,  lEEE-Transactions  on  Neural 
Networks,  1994. 

[7]  Fletcher,  R.,  Reeves,  C.M.:  Function  minimization  by  conjugate  gradients,  Computer  Journal,  Vol.  7, 
1964 

[8]  Gill,  P.E.,  Murray,  W.,  Wright,  M.H.:  Practical  optimization,  Academic  Press,  1981 

[9]  Himmelblau,  D.M.:  Applied  nonlinear  Programming,  McGraw-Hill,  1972 

[10]  Holland,  J.H.:  Genetic  algorithms.  Scientific  American,  1992 

[11]  Jacob,  H.G.:  Rechnergestiitzte  Optimierung  statischer  und  dynamischer  Systeme,  Springer,  1982. 

[12]  Krebs,  M.:  Vergleich  unterschiedlicher  Methoden  zur  Behandlung  nichtlinearer  Restriktionen  in  Opti- 
mierungsproblemen,  Diplomarbeit  am  Institut  fuer  Elektrische  Energiewandlung,  Technische  Hochschule 
Darmstadt,  Germany,  1990. 

[13]  Kuester,  J.,  Mize,  J.H.:  Optimization  techniques  with  Fortran,  Me  Graw-Hill,  1973 

[14]  Kuhn,  H.W.,  Tucker,  A.W.:  Nonlinear  Programming,  Proceedings  of  the  2nd  Berkeley  Symposium  on 
Mathematical  Statistics  and  Probability,  University  of  California,  Berkeley,  1951 

[15]  Luenberger,  D.G.:  Introduction  to  linear  and  nonlinear  programming,  Addison- Wesley,  1973 

[16]  Marglin,  S.A.:  Objectives  of  Water- Resource  Development  in  Maass,  A.  et  al.  Design  of  Water- Resource 
Systems,  Cambridge,  1966 

[17]  Marquard,  D.M.:  An  algorithm  for  least  squares  estimation  of  nonlinear  parameters,  Society  for  Indus¬ 
trial  and  Applied  Mathematics,  1963 

[18]  Morse  P.M.,  Kimball  G.E.:  Methods  of  operations  research,  Wiley  Sons,  New  York,  1950. 

[19]  Nelder,  J.A.,  Mead,  R.:  A  simplex  method  for  function  minimization.  Computer  Journal,  Vol.  7,  1964 

[20]  Pareto,  V.:  Cours  d’Economie  Politique,  Pouge  1896  or  translation  by  Schwier,  A.S.:  Manual  of 
Political  Economy,  The  Macmillan  Press,  1971 

[21]  Perm,  R.:  The  Superconducting  Magnet  System  for  the  LHC,  lEEE-Transactions  on  Magnetics,  1991 

[22]  Powell,  M.J.D.:  An  efficient  method  for  finding  the  minimum  of  a  function  of  several  variables  without 
calculating  derivatives.  Computer  Journal,  1964 


377 


Application  of  Optimization  to  the  Design  of  Electromechanical  Devices 

Sykulski,  J.  K.;  Cheng,  Y.  B. 

Department  of  Electrical  Engineering 

University  of  Southampton,  Southampton  SO  5NH,  UK 


Abstract 

The  paper  presents  an  automated  optimization  system  which  interfaces  an  external  optimizer  to  a  general  purpose  finite 
element  package.  The  program,  which  is  easy  to  use  and  user  friendly  with  several  pull-down  menus,  searches  for 
constrained  shape  design  variables  that  fulfil  the  defined  performance  criteria.  The  field  analysis  forms  part  of  the 
optimization  iterative  cycle.  The  idea  of  an  external  optimizer  is  very  attractive  as  it  provides  flexibility.  The 
optimization  process  can  be  interrupted  and  restarted  easily  without  loss  of  accumulated  information.  The  formulation 
of  the  Gauss-Newton  optimization  algorithm  with  Levenberg-Marquardt  modification  and  its  implementation  are  given 
in  some  detail.  Two  application  examples  are  described.  First,  a  d.c.  solenoid  actuator  with  truncated  cone  pole  face 
is  optimized  to  produce  a  user  specified  force-displacement  characteristic.  Secondly,  an  actuator  solenoid  is  optimized 
to  produce  maximum  energy  per  stroke.  The  ampere-turn  excitation  in  the  coil  is  governed  by  the  maximum  power  loss 
and  maximum  current  density  allowable  in  the  coil. 

Introduction 

The  control  valve  of  an  electrohydraulic  system  is  operated  by  a  solenoid  mounted  on  the  valve  block  (Fig.  1).  The 
magnetic  potential  difference  produces  an  attractive  force  betw’een  the  pole  face  and  the  armature,  which  moves  to  close 
the  airgap  and  minimize  its  magnetic  reluctance.  A  push  pin  extending  from  the  centre  of  the  armature  acts  directly 
on  the  spool. 


Fig.  1 .  The  electromechanical  actuator  (a)  and  a  typical  field  plot  (i) 

Accurate  field  modelling  of  a  d.c.  solenoid-operated  directional  control  valve  has  been  conducted  using  a  general 
purpose  finite  element  package  OPERA-2d  [27].  Magnetic  forces  acting  on  armature  at  different  displacements  have 
been  predicted  using  the  Virtual  Work  Principle  and  the  Maxwell  Stress  Tensor.  Computed  characteristics  correlate 
closely  with  experimental  results  [3].  The  solution  is  carried  out  in  axis-symmetric  mode  to  reflect  the  actual  symmetry 
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of  the  device.  The  non-linear  non-homogeneous  equation  is  solved  in  terms  of  magnetic  vector  potential  A 

Vxf-Vx^l  =  J-o~ 


where  J  is  the  imposed  current  density.  The  total  force  on  armature  may  be  computed  from  the  finite-element  solution 
using  the  Maxwell  Stress  Tensor  (MST)  by  performing  surface  integration  in  the  air  surrounding  the  armature.  Because 
of  axial  symmetry  this  reduces  to  carefully  selecting  appropriate  integration  lines  in  the  cross-section  of  the  device. 
Implementation  of  the  Virtual  Work  Principle  (VWP),  on  the  other  hand,  requires  at  least  two  solutions.  The  magnetic 
force  along  the  axial  (z)  direction  is  then  found  from 

„  3W'(.i,z)  (2) 


Instead  of  computing  the  force  from  the  coenergies  at  both  ends  of  an  incremental  displacement,  a  set  of  coenergies 
may  be  foimd  at  several  positions  along  the  armature  displacement.  A  least-square  polynomial  curve  is  then  fitted 
through  these  values  and  more  accurate  space  differentiation  is  applied  to  obtain  force/displacement  characteristic.  The 
coenergy  W’Ji.z)  for  nonlinear  axis-symmetric  solutions  is  given  by 

W'^  =  2f  —  J  27;r[ 


Automated  optimization  of  solenoid  actuators 

The  nonlinear  least-squares  problem 

Typically  m  data  values  (e.g.  the  desired  force  in  solenoid  actuator  optimization)  d, ,  d2,  ...  ,d^  are  given,  which  have 
been  sampled  for  values  tj  ,  t2  ,  ...  ,  of  some  independent  variable  t  (e.g.  the  armature  displacement).  It  is  then 
required  to  fit  a  function  (f>(t,  x)  which  has  n  design  variables  x  to  be  chosen  so  that  the  function  best  fits  the  data. 
Therefore,  the  weighted  residuals  are  given  by 

'’/(jc)  =  w.  [(I)(r.,x) -d.],  i  =  l,2,...,m  (4) 

with  the  weighting  factors  w,  representing  the  users  preference  and  a  least  squares  solution  is  sought  by  minimizing 

/(*)  =  ^  E  '■.(*)"  =  ^ &(xfR(.x)  (5) 

2  2 

The  first-derivative  matrix  of  R(x)  is  simply  the  Jacobian  matrix  J(x)  with  (/j]) -element  J(x)ij==dr/x)/dxj,  The  gradient 
of  f(x)  is 


g(*)  =  mx)-'£  r.(x)-Vr,(x)  =J(xfJl(x) 


Similarly,  the  Hessian  of  /{xj  is 


G(x)  =  V^/(x)  =  ^  [Vr,.(j:)-Vrj(i/  *  r,(i)-VV,.(*)]  =  J(x)  *  S(x) 


S(x)  =  5^  r,(x)-V^rj(*) 

1  =  1 

denotes  the  second-order  information  in  vy(xj.  Thus,  from  a  truncated  Taylor  series  expansion 

/(aCj^  +  6)  q^(5)  =4  +  +  V2 

the  quadratic  model  of  /fxj  around  Xj^  is  given  by 

?j(S)  =  1  R(x,/R(Xi,)  *  S^J(x^fR(x,)  *  1  d^(J(x,fJ(x^)->-S(x^))5 
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for  the  minimization  of  objective  flmction  of  eqn  (5).  The  full  Newton  step  is  given  by 

It  is  often  possible  to  ignore  the  second-order  term  SfXjJ  of  the  Hessian  altogether  on  the  grounds  that  if  the 
nonzero  residuals  are  not  of  the  type  that  reinforce  their  nonlinearity  [4],  then  J(x/J(x)  is  a  sufficiently  good  Hessian 
approximation.  In  the  resulting  Gauss-Newton  method,  the  ‘Newton  step’  from  x^^  is  defined  by 

-  [J(.XtfJ(x,)]-'j(x^fR{x,) 

Difficulties  arise  when  the  Hessian  matrix  is  not  positive  definite.  To  solve  this  problem,  an  idea  of  modifying 
matrices  by  addition  of  a  multiple  of  the  unit  matrix  was  suggested  at  an  early  date  by  Levenberg  [5]  and  Marquardt 
[6]  and  applied  to  Newton’s  method  by  Goldfeld  et  al  [7],  Hence,  the  step  5^  is  determined  by  solving  the  system 

(G,  +  X/)8^=-g,  ,  X.O  (13) 

When  Gk  is  close  to  being  positive  definite  then  it  may  only  be  necessary  to  add  a  small  multiple  of  unit  matrix  I  so 
as  to  give  a  good  direction  of  search.  All  Levenberg-Marquardt  algorithms  therefore  find  a  value  X>0  such  that  G^+XI 
is  positive  definite  and  solve  eqn  (13)  to  determine 

Difficulties  arise  again  when  the  underlying  quadratic  function  obtained  by  truncating  the  Taylor  series  for 
thef(x^+d),  does  not  have  a  unique  minimizer  and  the  method  is  not  defined.  Another  way  of  regarding  this  fact  is  that 
the  region  about  x^  in  which  the  Taylor  series  is  adequate  does  not  include  a  minimizing  point  of  A  more  realistic 
approach  therefore  is  to  assume  some  neighbourhood  of  x^  is  defined  in  which  q^(b)  agrees  /(x^+b)  in  some 
sense.  Then  it  would  be  appropriate  to  choose  Xk+}=Xf+bi„  where  the  correction  b(x)  minimizes  for  all  Xj^+b  in  12^. 
Such  a  method  is  called  a  restricted  step  method  or  sometimes  a  trust  region  method',  the  step  to  a  new  iterate  is 
obtained  by  minimizing  a  local  model  to  the  objective  function  over  a  restricted  ellipsoidal  region  of  validity  of  the 
Taylor  series  centred  about  the  current  iterate.  The  diameter  of  this  region  is  expanded  and  contracted  in  a  controlled 
way  based  upon  how  well  the  local  model  predicts  behaviour  of  the  objective  function.  It  has  been  shown  [8]  that  this 
method  retain  the  rapid  rate  of  convergence  of  Newton’s  method,  but  is  also  generally  applicable  and  globally 
convergent.  Let  be  the  radius  of  the  neighbourhood.  Hence 


and  seeking  the  solution  6^^ 


minimize 

b 


5  I  ^  P* 


using  a  double  dogleg  search  technique  instead  of  a  line  search. 

Three  optimization  algorithms  (Gauss-Newton,  Levenberg-Marquardt  and  Trust  region  methods)  have  been 
compared  and  tested  on  18  different  functions  recommended  by  More  [9]  for  nonlinear  least-square  problems.  Details 
of  this  comparison  may  be  found  in  a  recent  conference  publication  by  the  authors  of  this  paper  [3], 


Global  optimization  with  simulated  annealing 

In  recent  years  a  method  known  as  ‘Simulated  Annealing’  has  been  widely  used  in  combinatorial  optimization  with 
classical  direct  search  methods  to  seek  the  global  minimum  of  a  function  f(x).  It  is  essentially  an  iterative  random 
search  procedure  with  adaptive  moves  along  the  coordinate  directions.  The  algorithm  permits  uphill  moves  under  the 
control  of  a  probabilistic  criterion,  thus  tending  to  avoid  the  first  local  minima  encountered.  First,  it  reaches  an  area 
in  the  function  domain  where  a  global  minimum  should  be  present,  following  the  gross  behaviour  irrespectively  of  small 
local  minima  found  on  the  way.  It  then  develops  finer  details,  finding  a  good,  near-optimal  local  minimum,  if  not  the 
global  minimum  itself.  This  process  is  based  on  an  analogy  from  thermodynamics  where  a  system  is  slowly  cooled  in 
order  to  achieve  its  lowest  energy  state.  The  algorithm  was  independently  introduced  by  Kirkpatrick  et  al  [10],  and 
Cemy  [11].  The  name  ‘simulated  annealing’  originated  from  analogy  with  the  physical  annealing  process  of  solids. 
Other  names  used  are  Monte  Carlo  annealing  [12],  probabilistic  hill  climbing  [13],  statistical  cooling  [14],  [15]  and 
stochastic  relaxation  [16].  An  important  feature  of  the  simulated  annealing  algorithm  is  that  it  finds  high-quality 
solutions  which  do  not  strongly  depend  on  the  choice  of  the  initial  solution,  i.e.  the  algorithm  is  effective  and  robust. 
However,  simulated  annealing  technique  is  expensive  because  many  function  evaluations  are  necessary  [3]. 
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Concept  and  main  features  of  the  optimization  shell 


After  careful  studies  of  the  different  optimization  algorithms  mentioned  above,  the  Levenberg-Marquardt  method  was 
chosen  and  implemented  in  an  improved  version  of  the  Electromechanical  Actuator  Modelling  and  OptimizatioN 
(EAMON)  shell  [2],  which  also  interfaces  with  the  finite  element  package  OPERA-2d.  Fig.  2  shows  a  block  diagram 
of  the  open  architecture  system  of  EAMON,  The  graphical  user  interface  has  been  programmed  using  MATLAB’s 
toolkit.  This  eliminates  the  need  for  long  hours  of  programming  in  C-language  and  gives  the  flexibility  to  work  at  a 
higher,  more  productive  level  so  that  more  can  be  done  more  quickly  and  with  greater  accuracy,  For  the  same  reason 
the  optimization  routine  employing  Levenberg-Marquardt  algorithm  has  been  written  as  a  MATLAB  function  in  M-file 
format  (i.e.  optimizer.m).  Any  other  optimization  algorithm  which  can  be  written  in  M-file  format  can  replace  the 
chosen  Levenberg-Marquardt  algorithm. 

The  Open  Architecture 
of  EAMON 
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Fig.  2,  Block  diagram  of  EAMON 
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Another  important  feature  of  EAMON  shell  is  its  capability  to  interface  with  a  general  purpose  finite  element 
package,  i.e.  OPERA-2d,  without  the  need  of  modifying  this  commercial  software.  Script  files  for  various  displacements 
of  solenoid  armature  may  be  quickly  and  easily  created  for  any  predefined  actuator  model  without  any  user  intervention. 
Creation  of  batch  files  to  run  finite  element  analysis  is  also  automated.  Finally,  the  force  analysis  using  Virtual  Work 
Principle  is  conducted  automatically  after  extensive  checks  for  availability  of  legitimate  finite  element  solutions  and 
OPERA-2d  command  files.  Any  design  changes  to  the  actuator  model  can  be  made  easily  and  quickly  and  the  whole 
CAD  procedure  up  to  force  analysis  can  be  executed  in  the  shortest  time  possible.  The  whole  CAD  operation  will  be 
invisible  to  the  user  except  for  a  window  indicating  "Pre-processing",  "Running  analysis"  or  "Post-processing"  stages 
of  a  finite  element  function  call.  Tlie  computational  flowchart  illustrating  this  feature  is  given  in  Fig,  3. 


Fig,  3.  Computational  flowchart 
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The  EAMON  shell  is  easy  to  use  and  very  user  friendly.  Pull-down  menus  appear  on  the  menu  bar  below  the  figure 
title  as  shown  in  Fig.  4.  Under  "Setup"  menu,  two  dialogue  boxes  can  be  created.  The  initial  design  variables  and  their 
bounds  can  be  assigned  easily.  The  desired  force  characteristic  can  be  specified  by  assigning  forces  for  the  various 
armature  displacements  i.e.  d,  and  tj  in  eqn.  (4).  Up  to  fifteen  displacements  can  be  given,  not  necessarily  in  regular 
intervals.  In  addition,  the  weighting  factor  may  also  be  given.  The  design  variables  and  force-displacement 
characteristic  will  always  default  to  the  latest  updated  values.  Under  "Options"  menu,  frequently  changed  parameters 
for  the  optimizer  routine  can  be  set;  I.e.  the  maximum  number  of  function  evaluations;  the  termination  criteria  that  is 
a  measure  of  the  worst  case  precision  required  of  the  design  variables  x\  and  the  termination  criteria  that  is  a  measure 
of  the  precision  required  of  the  objective  function  /  at  the  solution.  The  optimization  does  not  terminate  unless  both 
termination  criteria  have  been  met  or  when  the  maximum  number  of  function  evaluations  has  been  exceeded. 

It  is  useful  to  have  the  "Stop"  and  "Restart"  buttons  so  that  the  optimization  software  can  be  interrupted  and 
restarted  easily  without  loss  of  accumulated  information.  Before  starting  a  new  iteration,  the  optimization  code  checks 
if  the  "Stop"  button  has  been  clicked  to  see  whether  the  user  wants  to  interrupt  the  computation.  Users  can  let  an 
optimization  routine  run  for  a  small  number  of  iterations,  then  look  at  the  progress  of  the  calculation  in  order  to  decide 
whether  to  let  it  run  further.  The  shell  plots  force-displacement  graphs  (required  characteristic  as  x’s,  initial 
characteristic  in  solid  line  and  latest/final  characteristic  in  dotted  line)  after  every  iteration  and  automatically  assigns 
units,  title  and  axis  labels  so  that  the  user  instantly  gets  the  right  presentation.  A  hardcopy  can  be  produced  by  simply 
typing  "print"  in  MATLAB  command  prompt.  In  addition,  optimization  results  (i.e.  number  of  function  calls,  number 
of  gradient  evaluations,  and  value  of  the  objective  function  /  at  the  last  evaluated  point)  will  be  displayed  in  the 
"Comment  window". 


Comment  V/indow 


Pres?  Setup  at  the  top  to  assign  design 
variables,  their  limits  and  desired 
force  characteristic.  Parameters  for  the 
optimizer  is  set  in  Options. 


Fig.  4.  The  EAMON  base  window 


Examples 

Example  I  A  d.c.  solenoid  actuator  with  truncated  cone  pole  face  is  to  be  optimized  to  produce  a  specified  force- 
displacement  characteristic.  The  five  constrained  design  variables  are  shown  in  Fig.  5.  Their  bounds,  initial  and 
optimized  values  are  tabulated  below. 
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design 

variables 

lower 

bounds 

upper 

bounds 

initial  values 

optimized  values 

0 

3.1 

1.2975 

1.2919 

X2 

0 

45 

12.5706 

13.9323 

3 

8 

5.6039 

5.5942 

0 

10 

5.2694 

1.62)0 

X, 

0 

3.1 

2.9056 

2.9256 

The  optimization  terminated  successfully  after  18  iterations  and  108  function  evaluations.  Fig.  6  shows  the  required 
forces,  and  the  initial  and  final  force-displacement  characteristics, 


Fig.  5.  A  section  of  solenoid  actuator  with  five  design  variables  to  be  optimized. 
lOOr 
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Fig.  6.  Required  (x’s),  initial  (solid  line)  and  final  (dashed  line) 
force-displacement  characteristics. 

Example  2  The  new  objective  in  this  example  is  to  maximize  the  area  under  the  force-displacement  curve  by  varying 
the  radius  of  the  armature  (Fig.  7)  while  maintaining  the  external  diameter  of  the  device  constant.  The  following 
assumptions  are  made  : 

1.  the  maximum  power  loss  in  the  coil  is  24  watts  for  a  24  volts  d.c.  supply; 

2.  the  maximum  current  density  in  the  copper  wire  at  I00°C  temperature  rise  is  assumed  10.67  A/mm^;  and 

3.  the  solenoid  coil  space  factor  X  Is  0,6353. 

The  choice  of  ampere-turns  in  the  coil  is  limited  by  assumption  (1)  or  (2)  whichever  is  less.  For  example,  for  a  design 
variable  jc=8mm,  the  mean  radius  of  coil,  r„,  =  1 1.1625  mm,  and  the  cross-section  area  of  coil,  A=\S\  mm^. 


383 


Assuming  "single  turn"  coil,  the  coil  resistance  is  given  by 

Pjoq/  (1.72xl.393xlO-5)2Tir„ 

- Ia - 


1.461381  xlO'5 


where  p,o„  is  resistivity  of  copper  at  100°C  temperature  rise,  /  is  the  length  of  the  coil  turn  and  A^.  cross-section  area 
of  copper  only.  The  excitation,  assuming  a  power  loss,  P=24W,  at  24V,  is 


(N/)  = 


1281.5  At 


The  excitation  assuming  maximum  current  density  in  copper  is  (NI)  =  10.67  x  A^.  -  1226.94  At.  Hence,  the  solenoid 
actuator  is  modelled  with  1226.94  At  excitation.  Consequently, 
power  loss,  P  =  {Nif  R,  =  21.9992  W; 
coil  current,  I  =  916.634  mA; 
number  of  turns,  N  =  1338;  and 
actual  coil  resistance,  R  -  26.1623  Q. 

The  design  variable  (radius)  has  a  lower  bound  of  6.5  mm,  upper  bound  of  9.5  mm  and  initial  value  of  9.2  mm.  The 
initial  area  under  the  force-displacement  curve  is  0.170557  Nm.  The  optimization  terminated  successfully  after  19 
iterations  and  36  function  evaluations  with  x  =  8.2157  mm,  NI  -  1 168.45  At,  P  =  21.36  W,  and  area  under  the  force- 
displacement  curve  =  0.191805  Nm.  Fig.  8  shows  the  initial  and  final  force-displacement  characteristics. 


frame  coil 


Fig.  7,  Solenoid  actuator  to  be  optimized. 


Fig.  8.  Initial  (solid  line)  and  final  (dashed  line) 
force-displacement  characteristics. 
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Abstract 

Genetic  Algorithms  (GAs)  are  search  methods  that  mimic  some  fundamental  functions  of  nature’s 
biological  evolution.  GAs  share  natural  systems’  high  level  of  robustness  as  result  of  their  being 
able  to  adapt  to  many  different  environments  and  to  locate  the  global  optimum  without  being 
attracted  to  the  local  extremes.  This  study  presents  an  application  of  GAs  to  the  design  opti¬ 
mization  of  electromagnetic  devices.  The  developed  optimization  system  consists  of  the  search 
tool  and  an  analysis  tool.  The  Unite  element  method  (FEM),  serving  as  the  analysis  tool  is 
incorporated  with  the  GA.  The  objective  function  is  formed  from  the  computed  field  parameters 
and  some  penalty  terms  usually  derived  from  the  geometrical  constraints.  The  method  is  applied 
to  a  size  optimization  problem.  The  results  are  highly  satisfactory  and  suggest  the  inclusion  of 
the  GAs  in  the  search  toolbox  of  a  universal  optimization  environment. 


Introduction 

John  Holland  invented  GAs  in  the  early  70s  with  the  goal  of  creating  a  computer  algorithm  that 
would  mimic  some  functions  of  natural  evolution.  If  the  adaptive  and  selective  mechanism  of 
evolution  can  be  reflected  upon  an  algorithm,  this  algorithm  can  then  be  used  to  solve  complex 
search  problems.  The  applications  that  have  appeared  since  the  arrival  of  the  technique  prove  that 
GAs  are  powerful  in  solving  complicated  problems  the  way  nature  does.  These  applications  cover 
a  wide  range  in  the  areas  of  optimization  and  machine  learning  problems,  including  scheduling, 
pipeline  system  optimization,  image  processing  and  pattern  recognition,  computer  vision,  VLSI 
circuit  layout,  structural  optimization,  adaptive  Alter  design,  nonlinear  equation  solving,  and 
neural  network  design  and  training  [1].  Recently,  design  optimization  of  electromagnetic  devices 
has  been  added  to  this  list  by  the  authors  [2]. 

In  nature,  evolution  works  on  the  chromosomes  of  individuals  in  the  population,  not  on 
the  individuals  themselves.  The  information  contained  in  the  chromosomes  is  decoded  into  the 
individuals  in  the  population.  The  combinations  of  genes  determine  the  characteristics  of  an 
individual.  The  rules  imposed  by  the  environment  govern  the  survival  of  the  population  and, 
therefore,  in  which  genetic  direction  it  evolves.  In  a  population,  the  chromosomes  that  decode 
into  more  successful  individuals  reproduce  more  often.  The  less  successful  ones,  and  chromosomes 
representing  them,  decline  in  number  because  they  cannot  appropriately  adapt  to  the  conditions 
of  the  environment.  Evolution  in  nature  is  an  ongoing  process.  It  is  dynamic  because  the 
environment  changes  continuously.  If  it  is  assumed  that  the  rules  of  the  environment  do  not 
change  for  a  sufficiently  long  period,  eventually  the  majority  of  the  members  of  a  population  will 
be  weU  fitted  to  the  environment. 
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In  a  GA,  three  of  nature’s  functions  of  biological  evolution  are  simulated.  These  are  repro¬ 
duction,  recombination,  and  mutation.  In  the  algorithm,  chromosomes  are  represented  by  finite 
length  character  strings,  usually  made  up  of  Os  and  Is.  Simulated  evolution  works  on  a  population 
of  strings  by  means  of  its  operators.  The  algorithm  does  not  have  any  information  about  the 
problem  it  is  trying  to  solve.  The  only  information  that  is  provided  to  it  is  how  the  chromosomes 
are  decoded  into  functional  individuals  of  the  population  and  how  successful  each  individual  is  in 
adapting  to  the  environment  [1,  3]. 

Goldberg  specifies  four  main  aspects  that  make  the  GAs  different  from  other,  more  conven¬ 
tional,  search  methods: 

1.  They  operate  on  a  population  of  points  in  the  search  space  simultaneously,  not  just  on  one 
point. 

2.  They  work  with  a  coded  string  representing  the  parameters,  not  the  parameters  themselves. 

3.  They  only  use  the  objective  function  value. 

4.  Their  rules  for  transition  are  probabilistic. 

Development  of  a  Genetic  Algorithm 

In  the  way  it  mimics  the  natural  selection  rule  of  evolution,  the  GA  offers  applicability  to  any 
optimization  or  search  problem.  Before  running  the  actual  program,  however,  there  are  four 
primary  steps  in  preparing  to  solve  a  problem  using  GAs  [4].  Prior  to  solving  a  problem  using 
GAs,  the  user  must  determine  the  representation  scheme,  the  fitness  measure,  the  stopping  rule, 
and  the  parameters  of  GA’s  operation. 

The  representation  scheme  is  the  way  the  design  variables  are  coded  into  the  finite-length 
character  strings.  An  alphabet  of  cardinality  K  is  used.  Although  some  researchers  choose  to 
use  higher  cardinality  alphabets,  at  least  in  Holland’s  sense,  the  alphabet  of  the  representation 
scheme  is  binary;  K  —  2,  whose  elements  are  ‘0’  and  ‘1.’  The  coding  of  the  chromosomes  is  a 
very  important  concept  directly  affecting  the  performance  of  the  GA.  It  is  user  and  problem  de¬ 
pendent.  Binary  or  Gray  coding  are  possible  examples.  The  representation  scheme  is  responsible 
for  “mapping”  every  point  in  the  search  space  onto  a  unique  chromosome  [4]. 

The  fitness  measure  is  an  indication  of  how  successful  an  individual  is.  The  fitness  measure 
is  assigned  to  the  chromosomes.  Every  chromosome  that  exists  during  evolution  is  evaluated  and 
its  fitness  measure  is  calculated.  The  fitness  measure  is  the  only  information  the  GA  has  relating 
the  population  to  the  environment.  In  an  optimization  problem,  the  environment  is  the  objective 
function.  The  objective  function  value  or  a  function  of  it  can  be  used.  The  definition  of  the 
fitness  measure  is  also  user  and  problem  dependent. 

GA’s  search  can  continue  indefinitely.  Even  if  the  population  reaches  the  global  optimum, 
mutations  account  for  a  genetically  dynamic  population.  Therefore,  a  stopping  rule  is  necessary. 
Some  of  the  popular  methods  are  to  fix  the  number  of  generations  and  to  use  the  best  individual  of 
all  generations  as  the  optimum  result;  to  fix  the  time  elapsed  and  to  select  the  optimum  similarly; 
to  let  the  entire  population  converge  to  an  average  fitness  with  some  error  margin. 

The  parameters  of  the  GA  shape  the  way  the  algorithm  runs.  There  are  two  primary  param¬ 
eters.  Population  size,  p,  is  the  number  of  chromosomes  in  the  population;  chromosome  length, 
I,  specifies  the  number  of  characters  (genes)  used  to  form  one  character  string.  There  are  two 
secondary  parameters  that  define  the  occurrence  probabilities  of  the  two  of  the  three  main  GA 
operators.  Crossover  rate,  p^  specifies  the  occurrence  rate  of  the  chromosome.  It  is  measured 
per  chromosome  pair;  mutation  rate,  Pm,  defines  the  occurrence  rate  of  a  mutation  per  gene. 

The  procedure  given  below  is  used  to  obtain  the  results  presented  in  this  paper.  There  are 
many  variations  of  implementing  a  GA.  Different  versions  of  algorithms  with  different  perfor¬ 
mances  can  be  synthesized  by  manipulating  either  the  algorithm  itself  or  its  functions,  or  both. 
The  algorithm  presented  here  is  the  canonical  GA  of  Holland  and  De  Jong  [5,  1]. 
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1.  Randomly  create  an  initial  population  of  p  chromosomes. 

2.  Evaluate  each  chromosome  and  assign  fitness  measures. 

3.  If  the  stopping  criterion  is  satisfied,  stop. 

4.  With  the  reproduction  operator,  create  a  new  population. 

5.  Randomly  pair  the  chromosomes  to  form  parents.  By  means  of  the  crossover  operator, 
mate  them  to  form  ofiFspring. 

6-  Apply  mutation  to  preserve  the  genetic  diversity. 

7.  Go  to  Step  2. 

In  the  above  procedure,  three  fundamental  operators  of  the  GAs  are  introduced;  1)  reproduc¬ 
tion,  2)  crossover  and  3)  mutation. 

Based  on  the  fitness  measure  of  a  chromosome  and  the  average  fitness  of  the  population,  the 
reproduction  operator,  also  called  the  selection  operator,  determines,  rather  stochastically,  the 
number  of  copies,  if  any,  that  particular  chromosome  wiU  have  in  the  next  generation.  There  are 
many  ways  of  designing  this  operator;  however,  the  underlying  idea  is  to  give  the  chromosomes 
with  higher  fitnesses  more  chance  to  be  represented  in  the  next  generation  but  leave  the  decision 
to  a  random  variable.  Through  reproduction  a  new  generation  of  the  same  number  of  members 
is  generated  from  the  previous  population. 

Once  the  reproduction  is  complete,  the  chromosomes  exchange  information  through  the 
crossover  operator.  The  crossover  operator  simulates  the  recombination  process  of  natural  evo¬ 
lution.  It  mates  two  parents  to  create  two  offspring. 

Crossover  occurs  at  a  rate  determined  by  pc,  the  crossover  probability.  This  GA  parameter 
is  defined  before  running  the  algorithm.  The  information  exchange  between  the  chromosomes 
is  performed  by  swapping  the  bits  of  two  parents’  chromosomes.  After  the  individuals  of  the 
population  are  mated,  for  each  pair,  aU  bits  from  a  randomly  selected  position  on  the  chromosome 
to  the  end  of  the  string  are  swapped.  In  uniform  crossover,  there  is  no  positional  bias  in  swapping 
the  bits.  This  implies  that  there  are  crossover  methods  where  the  crossover  probability  of  a  bit 
depends  on  the  position  of  that  bit  in  the  chromosome.  Another  variation  is  to  use  multiple 
point  crossover,  in  which  bits  positioned  between  randomly  selected  sites  on  the  chromosomes 
are  swapped  [6].  As  an  example  of  the  one-point  crossover,  suppose  the  coding  is  binary  with 
an  alphabet  A  —  {0, 1}.  Assume  that  before  crossover  two  strings,  ui  and  3,2  are  mated.  Also 
assume  that  the  crossover  site  for  this  pair  is  3,  as  indicated  above  with  the  symbol  The 
effect  of  crossover  is  shown  below. 

before  crossover  after  crossover 

ui  =  110|010110  a[  =  110|110001 

02  =  ioo|iioooi  02  =  loojoioiio 

The  third  fundamental  operator  of  GAs  is  the  mutation  operator.  Although  reproduction 
and  crossover  both  operate  at  the  chromosome  level,  mutation  operates  at  the  gene  level.  It 
occasionally  changes  the  value  of  a  gene  acting  as  a  protector  against  the  complete  loss  of  some 
important  genetic  information  by  providing  genetic  diversity  [4].  Mutation  is  a  background  oper¬ 
ator.  Associated  with  the  mutation  operator,  there  is  an  occurrence  probability,  pm,  determined 
along  with  the  other  GA  parameters.  This  probability  is  per  gene  not  per  chromosome.  The 
usually  high  total  number  of  genes  in  the  population  combined  with  the  background  nature  of 
the  mutation  operator,  Pm,  is  usually  a  small  number. 
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Figure  1:  Geometry  of  the  pot  core  transformer  example 


Pot  core  problem 

The  example  used  to  demonstrate  the  application  of  the  GAs  to  the  design  optimization  of 
electromagnetic  devices  is  a  pot  core  transformer.  Mohammed  and  Jones  provided  a  solution  for 
this  problem  using  a  dynamic  search  method  [7].  Here  the  same  geometry  is  used  to  provide 
grounds  for  comparison  of  the  results.  Shown  in  Figure  1,  the  design  is  symmetrical  along  the 
vertical  line  in  the  middle.  Therefore  only  one  half  of  the  problem  is  modeled  and  solved-  In 
the  original  problem,  the  magnetic  material  has  a  nonlinear  characteristic.  In  this  application, 
however,  the  iron  is  treated  as  a  linear  material  and  a  linear  solution  is  obtained.  The  permeability 
used  is  obtained  from  the  actual  B  ~  H  curve  where  the  linear  region  ends  as  /x^  =  1167.14. 
Although  Mohammed  and  Jones  took  them  into  account,  eddy  currents  are  not  considered  here. 
Due  to  these  reasons,  the  comparison  of  the  two  methods  should  not  be  quantitative. 


Design  variables 

The  six  design  variables  indicated  on  the  model  in  Figure  1  are  tabulated  in  Table  1.  The  ranges 
of  these  variables  are  based  on  the  two  sets  of  values  given  by  Mohammed  and  Jones  as  being 
starting  and  end  points  of  the  search.  Here,  lower  limits  are  allowed  to  be  less  and  upper  limits  are 
allowed  to  be  higher  than  those  values.  A  constraint  on  the  genetic  operators  are  implemented  to 
assure  that  parameters  do  not  overlap)  such  as  having  a  coil  that  is  higher  than  the  transformer 
itself.  These  ranges  are  also  given  in  Table  1. 


Objective  function 

The  objective  of  the  optimization  is  to  reduce  the  size  of  the  device  while  maintaining  a  certain 
magnetic  flux  density.  The  size  of  the  device  is  directly  dependent  on  the  cross-sectional  area 
modeled.  The  cross-section  can  be  calculated  from  the  design  parameters.  A  criterion  is  the 
magnetic  flux  density.  For  each  design,  the  average  magnetic  flux  density  in  the  air  gap  is 
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Design  Variables 

lower  limit  (mm) 

Upper  limit  (mm) 

D 

total  width  of  the  transformer  (half  is  modeled) 

11.0 

15.0 

C 

height  of  the  transformer 

5,0 

9.0 

H 

height  of  the  coil 

1.8 

3.5 

W 

width  of  the  coil 

3.0 

6,5 

R 

distance  of  the  coil  from  the  center 

2.0 

3.5 

A 

width  of  the  air  gap 

0.007 

0.4 

Table  1:  Design  variables  of  the  pot  core  problem  and  their  ranges 


calculated  and  compared  with  the  value  calculated  for  the  smaller  device  proposed  by  Mohammed 
and  Jones. 

These  two  comparisons,  area  and  flux  density,  yield  to  two  fitness  parameters  which  have  to 
be  combined  to  form  one  fitness  value.  This  is  done  using 


where 


^  ^  flux  "1”  ^area 


^flux  =  10  log 


I B  desired  B  calculated  \ 


(1) 

(2) 

(3) 


The  logarithm  is  used  because  the  reciprocal  of  the  difference  in  the  flux  densities  may  become 
high.  The  coefficient  should  bring  the  weights  of  the  parameters  to  the  same  order. 


GA  parameters 

The  GA  parameters  set  used  for  the  pot  core  optimization  problem  is  given  in  Table  2.  Each 
of  the  30  individuals  are  made  of  48  bits  representing  6  design  variables  with  8  genes  each. 
The  representation  scheme  is  Gray  coding.  A  crossover  probability  of  0.8  is  used  with  uniform 
crossover.  The  mutation  rate  is  0.008.  The  process  is  stopped  after  100  generations  and  the  best 
individual  is  picked  as  the  optimal  slotion. 


Results  of  pot  core  problem 

The  final  values  of  the  6  design  parameters  are  given  in  Table  3.  In  the  same  table,  these  values 
are  compared  to  Mohammed  and  Jones’  results,  which  utilized  dynamic  search  to  reach  these 
values  [7].  In  Figure  2  the  change  of  the  fitness  of  the  best  individual  is  shown.  The  very 
high  fitness  that  appears  in  the  first  and  following  three  generations  is  not  taken  into  account 
as  that  may  be  results  of  a  premature  convergence  phenomenon.  As  the  iterations  continue, 
this  unreliable  peak  disappears.  However,  one  can  argue  that  this  particular  individual  has  the 
best  fitness  and  therefore  must  be  selected  as  the  optimum  result.  In  the  results  shown  here, 
nevertheless,  the  best  fitness,  which  appears  toward  the  end  of  the  100  generations  is  chosen  as 
the  optimum. 

In  Figure  3  the  field  solution  is  shown  with  isopotential  lines.  The  air  region  that  surronds 
the  transformer  is  also  modeled.  For  the  demonstration  purposes,  however,  these  regions  are  left 
out  of  the  figures.  As  one  of  the  objectives  is  to  reduce  the  size  of  the  device,  graphing  the  best 
size,  the  cross-sectional  area  in  this  case,  against  the  generations  provides  valuable  insight  into 
how  the  search  proceeds.  This  is  shown  in  Figure  4  The  peak  of  the  first  several  generations  is 
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GA  Parameters 

Number  of  individuals  (/z) 

30 

Number  of  chromosome  segments 

6 

Segment  length 

8 

Chromosome  length 

8 

Representation  scheme 

Gray  code 

Fitness  scaling 

linear 

Reproduction  method 

remainder  stochastic  without  replacement 

Crossover  method 

uniform 

Crossover  probability  (pc) 

0.8 

Mutation  probability  (pm) 

0.008 

Termination  criteria 

100  generations 

Table  2:  GA  parameters  of  the  pot  core  optimization  problem 


Design  variable 

Optimized  value  (mm) 

Dynamic  search  results  (mm) 

D 

11.0471 

12.303 

C 

5.87843 

6.655 

W 

1.96 

2.349 

H 

3.76863 

4.724 

R 

2.95294 

2.349 

A 

0.02087 

0.089 

Table  3:  Final  values  of  the  design  variables  and  comparison  to  the  results  from  the  dynamic 
search  method 


Figure  2:  Change  of  the  best  and  average  fitness  values  over  generations 
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observed  to  represent  a  bigger  size  than  than  that  of  the  selected  peak  as  the  optimal  solution. 
This  implies  that  the  first  peak  is  a  result  of  a  better  match  in  the  other  criterion,  the  magnetic 
field. 

It  must  be  noted  here  that  the  comparison  with  the  dynamic  search  method  is  to  establish  a 
performance  pattern  for  GAs  with  a  qualitative  insight.  The  GA  solution  is  linear  and  does  not 
include  eddy  currents  and  losses  as  the  solution  by  dynamic  search  did. 


Conclusion 

Genetic  algorithms  and  the  finite  element  method  together  are  applied  to  the  design  optimization 
of  an  electromagnetic  device. 

GAs  are  search  methods  that  provide  a  high  level  of  robustness  by  simulating  nature’s  charac¬ 
teristic  of  adapting  to  many  different  environments.  Categorized  under  stochastic  search  methods, 
GAs  are  distinguished  from  other  random  search  techniques  by  their  highly  structured  informa¬ 
tion  exchange  schemes.  The  algorithm  explores  the  search  space  and  exploits  the  similarities 
between  the  traits  of  the  chromosomes  to  direct  the  search.  GAs  perform  especially  well  for  mul¬ 
tidimensional  search  problems  with  nonconvex  and  nondifferentiable  objective  functions.  They 
base  the  search  only  on  the  fitness  values  and  do  not  rely  on  the  derivatives  of  the  objective 
function  or  any  other  auxiliary  information.  GAs  are  fully  parallelizable. 

The  performance  of  GAs  is  observed  to  be  quite  high.  For  complicated  problems,  the  robust 
scheme  results  in  comparable  speed  and  guaranteed  good  solution.  Some  savings  on  the  CPU 
time  are  achieved  by  keeping  the  fitness  history  of  the  previous  generation. 
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ABSTRACT 

Classically,  the  electromagnetic  field  problem  is  posed  as  a  direct  one  in  which  the  device  is  given  and  the 
performance  of  the  device  is  predicted  using  numerical  analysis  techniques.  However,  in  real  engineering 
electromagnetic  applications,  the  problem  is  posed  as  an  indirect  one  where  the  performance  is  given  and 
the  device  optimum  design  parameters  are  desired.  In  the  design  optimization  procedure,  the  indirect 
problems  are  also  known  as  inverse  problems  and  are  solved  by  optimizing  an  object  function,  which  is 
based  on  the  given  performance,  with  respect  to  some  chosen  design  parameters.  Nowadays  deterministic 
and  nondeterministic  methods  are  used  for  the  optimization  procedure. 

This  paper  will  present  an  overview  of  the  deterministic  gradient  based  optimization  technique.  Further 
difficulties  in  implementing  the  technique  will  be  discussed  and  the  results  of  two  case  studies  employing 
linear  constraints  will  be  presented. 


INTRODUCTION 

The  conventional  approach  in  solving  engineering  design  problems  can  be  classified  as  direct  one.  The 
designer  chooses  an  initial  design  and  modifies  the  design  in  such  a  way  to  satisfy  an  expected 
performance.  The  modification  is  based  on  some  heuristic  rules  and  it  is  a  trial  and  error  method  [IJ.  The 
major  drawback  of  this  approach  is  that  the  final  design  hardly  satisfies  the  expected  performance.  In 
addition  to  this,  the  heuristic  rules  are  based  on  experience  and  they  are  not  sufficient  enough  to  achieve  an 
optimum  design  [2].  Recently,  the  deterministic  and  several  non-deterministic  approaches  for  design 
optimization  were  reported  in  the  literature.  Non-deterministic  methods  do  not  use  the  gradients  for  the 
optimization.  Even  though  they  do  not  use  the  computationally  expensive  gradient,  they  may  also  become 
expensive  for  a  design  problem  with  large  number  of  parameters.  However,  the  application  of  these  non- 
deterministic  methods  cannot  be  avoided  for  the  inverse  problems  with  object  functions  which  have  a  global 
minimum  and  several  unrealistic  local  minima  [3,  4],  In  this  paper  the  gradient  based  deterministic 
approach  for  the  design  optimization  is  discussed. 

In  the  indirect  approach  the  design  problem  is  posed  as  an  inverse  one  where  the  aim  is  to  predict  the 
design  of  a  device  in  such  a  way  to  satisfy  a  given  performance  as  best  as  possible  [1-3,  5-7].  To  do  so  an 
initial  design  is  chosen  and  repeatedly  modified  in  a  formal  manner.  Here,  at  each  step,  the  modification  of 
the  device  is  done  based  on  some  analytical  methods  which  have  some  theoretical  background  [2].  In  the 
indirect  approach  the  problem  is  mathematically  formulated  as  an  inverse  problem  and  a  method  based  on 
the  sensitivity  analysis  is  applied  to  optimize  the  device  [1-3,  5-7]. 

In  this  work,  the  indirect  design  approach  for  solving  the  inverse  problems  is  used  in  the  design 
optimization  of  electormagnetic  devices.  To  perform  the  optimization  in  a  formal  manner, 

(i)  the  configuration  of  the  device  is  defined  in  terms  of  some  chosen  design  parameters,  p,  and, 

(ii)  the  performance  is  expressed  as  a  function,  f(p),  in  terms  of  some  electromagnetic  parameters 
such  as  capacitance,  inductance,  magnitude  and/or  phase  of  electromagnetic  fields,  etc.  Then  a 
suitable  expression  for  the  desired  performance,  fdCp),  is  also  chosen. 

(iii)  a  suitable  optimization  procedure  is  applied  to  find  values  of  p  in  such  a  way  that  the 
corresponding  f(p)  is  close  to  fd(p)  as  best  as  possible. 

(iv)  a  design  evaluation  criteria,  F(p),  which  is  based  on  the  comparision  of  f(p)  to  fdCp)  is  defined.  In 
practice,  the  evaluation  criteria  or  object  function,  F(p),  is  chosen  in  such  a  way  that  the  optimum 
design  satisfies  F(p)  at  its  extremum  point. 
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The  efficient  deterministic  optimization  techniques  are  based  on  the  computation  of  the  gradient  of  the 
function  F(p)  which  has  to  be  extremized.  In  other  words,  the  deterministic  methods  of  the  Mathematical 
Programming  requires  the  gradient  of  the  object  function  to  find  the  search  direction  in  the  optimization. 
To  reduce  the  cost  of  the  optimization  other  methods  such  as  Sequential  Approximate  Optimization  (SAO) 
were  used  [5]. 


Sequential  Approximate  Optimization  (SAO) 

In  the  60's  and  70's,  the  gradient  of  the  object  function  was  computed  by  perturbing  one  of  the  design 
parameters,  p  at  a  time  [1-3,  5-7].  It  is  clear  that  the  gradient  computation  requires  the  evaluation  of  F(p) 
hundreads  of  times  and  each  one  needs  expensive  field  computation.  Sequential  Approximate  Optimization 
(SAO)  overcomes  this  difficulty  by  analyzing  the  device  for  a  number  of  sets  of  values  p  and  predicting  the 
response  at  some  other  values  of  p  [5,  8,  9].  In  SAO,  F(p)  can  be  approximated  either  by 

(i)  fitting  a  curve  with  different  values  of  the  function,  f(p)  for  different  sets  of  p,  or  by 

(ii)  expressing  F(p)  as  a  Taylor's  series  expression  of  f(p)  with  respect  to  an  initial  design  with  the 
parameters,  poand  using  the  higher  order  derivatives  f(po). 

Fast  reanalysis  techniques,  which  are  among  the  SAO  use  the  field  solution  corresponding  to  p  to 
evaluate  the  field  solution  corresponding  to  the  parameter  values,  (p+Ap).  These  techniques  are  used  in 
static  and  eigenvalue  problems  [5].  In  this  approach,  the  field  corresponding  to  the  set  of  parameters  p  is 
evaluated  as  a  solution  of  the  global  system,  P(f)=Q.  Then  the  global  system  corresponding  to  the 
parameters,  (p+Ap)  is  written  in  terms  of  the  sensitivities,  9P/3p  and  3Q/3p,  and  the  difference  vectors  and 
Ap.  The  new  system  has  more  unknowns  than  the  equations  and  it  can  be  solved  for  the  Acj)  and  Ap,  if  one 
can  obtain  some  relationships  between  these  unknowns. 

SAO  give  reasonable  results  for  inverse  problems  having  small  number  of  design  parameters.  But  these 
techniques  may  become  expensive  for  designs  having  large  number  of  parameters.  In  the  application  of 
Fast  reanalysis  techniques  one  may  have  to  make  several  assumptions  to  derive  the  relationships  in  between 
the  unknowns.  These  assumptions  may  simplify  the  problem  too  much  and  therefore  the  optimum  design 
may  not  be  practically  implemented.  To  overcome  these  difficulties  the  Inverse  Problem  Methodology 
(IPM)  was  introduced  [6,7]. 


Gradient  based  Inverse  Problem  Methodology  (GIPM) 

The  Inverse  Problem  Methodology,  the  formal  method,  to  solve  the  inverse  problems  was  evolved  in 
1989  [6].  In  this  method,  the  simple  least  square  based  object  function,  F(p)  is  defined  in  such  a  way  that 
the  optimum  design  satisfies  F(p)  at  its  minimum.  In  most  of  the  applications,  the  object  function,  F(p),  is 
differentiable  with  respect  to  p.  The  methodology  uses  this  differentiable  property  of  the  local  matrix 
equation  of  the  differential  [7]  or  integral  methods  [10]  of  the  electromagnetic  field  computation. 
Eventually  the  global  matrix  equation,  P<|)  =  Q,  is  differentiated  to  obtain  the  derivative  of  F(p).  In  other 
words,  P(|)  =  Q  is  analytically  differentiated  with  respect  to  each  of  the  design  parameters.  In  the  direct 
approach  for  gradient  computation,  the  differentiated  system  is  solved  for  the  derivative  of  the  field  variable 
with  respect  to  the  parameters.  Then  the  gradient  of  the  object  function,  VF(p),  is  computed  using  these 
derivatives.  Manipulation  of  the  differentiated  system  results  in  a  matrix  equation  of  the  form,  P(^<|)/^p)  = 
R.  Therefore,  solving  for  the  gradient,  (^({)/^p)  is  simplified  as  solving  a  sytem,  P(0(|)/3p)  =  R  which  has  the 
same  matrix  P  as  that  of  the  original  global  matrix  equation,  P(j)  =  Q .  In  otherwords,  the  same  matrix 
equation  is  solved  several  times  for  different  right-hand-side  vectors.  The  IPM  reduces  the  cost  of  the 
gradient  computation  and  makes  the  deterministic  methods  applicable  for  the  design  problems  with  a  large 
number  of  parameters.  In  this  section  the  direct  approach  for  the  gradient  computation  is  introuduced; 
another  one  known  as  adjoint  approach  for  the  gradient  computation  will  be  discussed  later. 

Method  Impiementation 

As  it  was  mentioned  before,  the  first  step  of  implementing  the  Gradient  based  Inverse  Problem 
Methodology  (GIPM)  for  a  design  optimization  problem  is  choosing  p,  f(p)  and  F(p).  Then  a  set  of  initial 
values,  po  are  chosen  for  p  and  then  the  following  procedure  is  repeated  until  the  optimum  design  is 
obtained: 
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i.  Create  the  mesh  for  the  differential  or  integral  method,  form  the  global  matrix  equation  ,  P<1)  =  Q 
and  solve  for  the  electromagnetic  state  variable,  0. 

ii.  Evaluate  the  object  function,  F(p),  and  check  whether  the  design  is  optimum  or  not.  If  the  design  is 
an  optimum  one  then  stop. 

iii.  Evaluate  the  right  hand  side,  R,  corresponds  to  the  differentiated  system,  P(3(|)/dp)  =  R  and  solve 
for  the  gradient,  (3(j)/3p).  Then  compute  the  gradient,  VF(p)  using,  (9(j)/3p).  A  set  of  new  values  for 
the  design  parameters  are  given  by  pnew  =  P  -  ot(VF(p))' 

iv.  Repeat  the  steps  (i).  -  (iii).  until  the  optimum  is  reached. 

At  each  iteration  of  this  optimization  procedure,  the  design  parameters  vary,  so  a  new  mesh  has  to  be 
created  for  the  field  computation  in  step  (i).  The  published  literature  shows  at  each  step,  if  the  mesh  is 
created  independently  then  the  resulting  object  function  may  become  non  smooth  with  several  unrealistic 
local  minima  [1 1,  12].  In  [1 1],  the  necessity  of  imposing  some  regularity  constraints  implicitly  in  the  mesh 
by  maintaining  the  mesh  topology  unchanged  throughout  the  optimization  procedure  was  mentioned. 
Further,  in  [12],  it  was  proved  that  the  difficulty  of  having  the  local  minima  can  be  improved  by 
maintaining  the  mesh  topology.  . 

In  some  applications,  it  was  observed  that  the  optimum  design  obtained  with  the  above  mentioned 
regularity  constraints  may  not  be  smooth  enough  for  practical  purposes,  even  though  it  satisfies  the 
performance  criteria  [15,  16].  It  was  shown  that  this  difficulty  is  caused  by  the  independent  movement  of 
finite  element  mesh  nodes  which  are  having  the  coordinates  as  one  of  the  design  parameters  [15].  Design 
Element  Technique  (DET)  [15]  and  Structural  Mapping  Technique  (SMT)  [16]  are  used  to  overcome  this 
difficulty.  SMT  has  its  drawback  of  using  three  variables  per  each  node  of  the  finite  element  mesh.  In  the 
DET,  a  suitable  design  element  is  defined  by  grouping  some  of  the  finite  elements  and  the  movements  of 
the  nodes  of  the  design  element  are  not  independent  any  more.  In  other  words,  the  movement  of  the  nodes 
are  constrained  implicitly.  In  this  paper,  it  is  demonstrated  through  two  example  that  imposing  some  linear 
constraints  explicitly  on  these  surfaces  in  such  a  way  to  maintain  their  smoothness,  guamatees  a  realistic  and 
practical  optimum  design. 

Sensitivity  Derivatives 


In  the  previous  sections,  the  GIPM  was  introduced  as  a  formal  approach  to  solve  inverse  problems  by 
choosing  some  design  parameters,  p,  and  defining  an  object  function,  F(p),  based  on  the  given  performance 
criteria.  It  is  obvious  that  the  electromagnetic  fields  of  a  configuration  depends  on  p.  The  performance, 
f(p),  is  expressed  in  terms  of  some  electromagnetic  parameters.  Therefore,  simply  it  can  be  considered  as  a 
function  of  the  electromagnetic  field,  (|),  and  expressed  as  f(<t)(p),p).  Eventually,  the  object  function,  F(p), 
which  is  the  design  evaluation  criteria,  takes  the  form, 

F(p)  =  F(f(p),fci(p),p)  =  F(f((j)(p),p),fd((|)(p),p),p)  =  F((j)(p),p)  (1) 

and  the  most  simplest  form  of  F  is  a  quadratic  expression  which  compares  f(p)  and  fd(p)  in  least  squares 

Even  though  the  computation  of  the  finite-difference  approximations  of  sensitivities  are  simple  and 
accurate  [5],  they  are  computationally  expensive.  Therefore,  the  analytical  derivatives  are  frequently  used  in 
the  GIPM  to  reduce  the  cost. 

Using  the  chain  rule  of  differentiation,  the  derivative  of  the  object  function,  dF(p)/dp,  is  given  by 

dF  3F  3F  3(j)  (o) 

The  first  term  of  (2)  is  easy  to  obtain  or  in  most  of  the  cases  zero  [5,  7],  the  computation  of  the  second  term 
is  discussed  here. 

The  global  matrix  equation  of  the  differential  or  integral  method  is  of  the  form, 

P(|)  =  Q. 

Differentiating  (3)  with  respect  to  p  results. 


3(i)  dP  3Q  (4) 

Substituting  (4)  in  (2)  gives, 

dF  3F  (5) 

dp  “  dp  3(|)  ^  Ldp  "  dp 

Eq.  (4)  can  be  solved  for  (d(j)  /dp)  since  0  is  already  known  by  solving  (3)  and  the  other  two  sensitivity  terms 
are  computable  by  evaluating  the  sensitivities  of  the  local  matrices  element  by  element.  In  the  direct 
approach  for  the  gradient  computation,  (dF/dp)  is  evaluated  by  solving  (4)  for  (d({)/dp)  and  substituting  it  in 
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(2).  But  this  approach  becomes  inefficient  in  design  optimizations  with  large  number  of  p  because  (4)  has 
to  be  solved  n  (  =  number  of  parameters)  times  in  each  iteration  [3]. 

In  the  adjoint  approach,  the  following  equation  is  solved  for  the  adjoint  vector,  X,  corresponding  to  F 

then  the  gradient  is  computed  by  substituting  X  for  (3F/9(1))  P  in  (5): 

The  gradient  of  a  constraint,  G  can  be  evaluated  in  a  similar  way  and  it  is  obvious  that  the  adjoint  approach 
reduces  the  cost  as  it  requires  only  one  matrix  solution. 

Case  Studies 

In  this  work,  design  optimization  of  an  electromagnet  and  a  magnetic  pole  face  are  considered  as  examples. 
In  both  the  cases  the  objective  is  to  obtain  a  constant  flux  density  in  the  air  gap.  In  the  first  example  in 
addition  to  this  objective  minimization  of  the  leakage  flux  is  also  considered  as  a  second  objective. 

Design  Optimization  of  a  Magnetic  Circuit  •  ■  u 

The  objective  is  to  design  a  magnetic  circuit  for  a  constant  flux  density  of  B  =  l.OT  in  the  air  gap  with  a 
minimum  leakage  inductance.  The  design  parameters  chosen  for  the  optimum  design  are  shown  in  Fig.  1. 
Four  measurement  points  are  chosen  along  the  air  gap,  (shown  by  a  dashed  line  in  Fig.l)  then  the  values  of 
the  computed  and  desired  flux  densities  are  compared  in  the  least  squares  sense.  The  leakage  inductance  is 
given  by  [14] 

T  N[(Aa-Ab)-(Ac-Ad)]  (8) 

Meakage—  j 

where 

Aa.  Ab,  Ac  and  Ap  are  the  MVPs  at  the  points  A,  B,  C  and  D  respectively  (Fig.  1)  and  N  is  the 
number  of  turns. 

The  object  function  can  be  simply  defined  as 
F  =  F(B(p),A(p))  =Fi+F2 

where 


=  5S( 


rNf(AA-AB)-(Ac-AD)]1 


computed 


The  derivative  is  given  by 


■'LbL 


computed  ^  desiredj 


rN[(AA-AB)-(Ac-AD)n 

VdAA 

dAs^ 

/dAc 

dAjy\ 

L  I  J 

HdP 

-  dpj 

■(dP 

-  dpJJ 

^  |-Nr(AA-AB)-(Ac-AD)]j  i-NKAA-ABKAc-ADijja^  (9^) 

MVP  is  chosen  as  the  state  variable  for  the  FEM  analysis  and  at  any  point  in  the  solution  region  the  flux 


rN[(AA-AB)-(Ac-AD)]idI 


density,  B  is  calculated  using  the  curl  equation,  B=  V  x  A: 
dB  1  rrdA\d  /dA\  (  aA\a  / 

ap  =  bL  wJa?  W  ^  V  axj  J 

Within  a  first  order  triangular  element,  (aA/3x)  is  approximated  by  the  equation  [14], 
[bi  b2  bs]  [Ai  A2  Ab]^ 

[ci  C2  C3]  [Ai  A2  As]^ 
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Measurement  Points 


Fig.  1  :  Parametric  description  of  an  Electromagnet 


bi  =  (Yii  -  yi2)/(2A),  ci  =  (Xj2  -  Xii)/(2A)  i= 

(Xi,yi)  are  the  vertices  of  the  triangles 
A  is  the  area  of  the  triangle 

and  Aj  is  the  magnitude  of  the  MVP  at  the  i-th  vertex. 
Therefore, 


i=l,2,3,  il  =  i  mod  3  +1,  i2  =  il  mod  3  +1, 


l(^)=  A3]‘  +  [c,  C2  C3]  (lOe) 

Using  eqns.  (10a  -  lOe)  and  (9b),  (3B/3p)  and  then  (dF/dp)  can  be  evaluated.  Here  p  has  7  elements  with  no 
nonlinear  constraints,  so  application  of  the  adjoint  method  to  calculate  the  gradient  would  be  more  efficient 
computationally  in  comparison  to  the  direct  method. 

Do  the  parameters  have  to  be  constrained?  In  general,  the  FEM  is  applied  within  a  closed  boundary. 
In  the  GIPM,  at  each  iteration  the  mesh  is  changed  according  to  the  values  of  the  parameters.  The  values  of 
some  of  the  parameters  take  some  of  the  nodes  out  of  the  exterior  boundary  or  out  of  material  regions  of 
those  particular  nodes.  To  avoid  the  difficuties  in  modifying  the  mesh  during  the  optimization  procedure 
the  following  linear  constraints  are  imposed  on  the  parameters: 

a  >0.0,  b>0.0,  c>0.0,  g  >  0.0,  h  >  0.0,  I  >  0.0,  w  >  0.0  (lla) 

b-3w>0.0  (11b) 

c- 2a -2b  >0.0  (He) 

h-2b-g>0.0  (lid) 

h<1.75E-l  (He) 

In  this  case  study,  a  multi-level  optimization  is  implemented.  GEPM  is  initially  implemented  to  the  design 
having  initial  parameter  values  (given  in  the  first  Row  of  Table  1).  One  can  see  (in  Row  2  of  Table  1)  that 
the  parameter  h  reached  the  value  1.749E-1  during  that  optimization  procedure.  Even  though  the  active  set 


Table  1:  Optimization  Results 


5.000E-2  3.000E-1 
5.000E-2  3.000E-1 


4.994E-2 

4.992E-2 


3.000E-1 

2.999E-1 


g 


2.000E-3 

2.000E-3 

2.399E-3 
j  2.394E-3 


“570Tr0E-2  5.000E-2  3.000E-1  2.000E-3  l.OUOOEl  1.250E-1  l.OOOE-2  l.U30t-i  4.iD/n-z 

5.000E-2  5.000E-2  3.000E-1  2.000E-3  l.OOOOEl  1.749E-1  l.OOOE-2  8,476E-2  4.088E-2 

4.996E-2  4.994E-2  3.000E-1  2.399E-3  l.OOOOEl  1.749E-i  9.887E-3  3.186E-3  3.820E-2 

4.995E-2  4.992E-2  |  2.999E-1  2.394E-3  l.OOOOEl  1.749E-1  9.887E-3  3.148E-3  3.820E-2 

Row  1:  Initial  Parametrs  and  the  values  of  the  object  functions,  Fi  and  F2  (eq.  9a); 

Row  i:  Parametrs  and  the  values  of  the  object  functions,  Fi  and  F2  after  level  (i-1),  i=2,3,4; 

(The  parameters  which  have  changed  more  during  that  level  compared  to  the  other  ones  are  high  lighted.) 


FI 


1.036E-1 

8,476E-2 

3.186E-3 

3.148E-3 


F2  (Mink) 


4.157E-2 

4.088E-2 

3.820E-2 

3.820E-2 


398 


strategy  is  applied  for  the  optimization  with  the  linear  constraints  (11a  -  He)  the  procedure  stopped  with 
the  parameter  values  given  in  Row  2,  because  the  parameter  h  has  the  dominant  component,  (dF /dh),  of  the 
derivative,  (dF/dp).  To  continue  the  optimization  procedure  further  this  dominant  component  having  the 
value  of  the  order,  1E9,  has  to  be  discarded,  that  is  ,  the  component  is  set  to  zero.  In  other  words,  at  the 
second  level,  the  optimization  procedure  is  proceeded  further  with  6  parameters.  At  the  second  level  of  the 
optimization  procedure,  changes  were  observed  in  the  values  of  the  parameters,  g  and  w  (Row  3  of  Table  1). 
It  also  observed  that  the  component,  (dF/dw),  has  a  higher  value  compared  to  the  other  components.  So  at 
the  third  level,  both  of  the  dominant  components,  (dF/dw)  and  (dF/dh)  were  discarded  by  setting  to  zero 
and  the  optimization  was  continued  with  5  parameters  and  the  optimized  values  are  listed  in  Row  4. 

Columns  7  and  8  shows  that  the  object  functions,  Fl  and  F2,  (eq,  9a)  which  correspond  to  both  the 
objectives  decrease  during  the  optimization.  At  the  third  level  it  was  observed  that  the  object  function,  F2, 
has  a  tendency  to  increase.  Therefore,  the  optimum  design  obtained  during  the  third  level  was  considered  as 
the  final  one  and  the  authors  stopped  at  the  third  level.  One  who  concerns  more  on  the  objective 
corresponding  to  the  object  function,  Fl  ,  can  give  more  weights  to  Fl  and  proceed  further  for  a  more 
accurate  optimum  design. _ 

Stator:  |i=20.0 

Stator:  |J.=20.0 

Variable  Pole  face  is  the  contour  which 
passes  through  the  points  Pi,  i=l,..,5. 


Measurement  points  pQ 

Air:  u=1.0  ^ 

po 

CoU:  J=0.8  4=1.0 

Backiron:  p 

=20.0 

Stator:  4=20.0 

Air:  4=1.0  ^ 

o 

CoU:  J=0.8  4=10 

Backiron:  4=20.0 

(a)  (b) 

Fig.  2:  a.  Parametric  description  of  a  Pole  Face  b.  Subregions  of  the  solution  region 

Design  Optimization  of  a  Magnetic  Pole  Face 

The  objective  is  to  design  a  magnetic  pole  face  with  a  constant  flux  density  of  By  =  l.OT,  in  the  air  gap. 
The  design  parameters  chosen  for  the  optimum  design  are  shown  in  Fig.  2a.  Here  the  width  of  the  air  gap  is 
wider  than  the  length,  so  n  measurement  points  are  chosen  along  the  air  gap,  (shown  by  a  dashed  line  in 
Fig.2a)  then  the  values  of  the  computed  and  desired  flux  densities  are  compared  in  the  least  squares  sense. 
The  object  function  is  defined  as  [17,  18], 


F  =  F(By(p))=^7,(B 


4i< 


’ycomputed 


where  n  is  the  number  of  measurement  points  and  i^ycQYnpnted  ^ydesired^  y-component  of  the 

computed  flux  density  and  the  desired  flux  density  at  the  i-th  measurement  point. 

The  derivative  is  given  by 


^ycomputed 


’  y  computed  J 


Here  also  the  MVP  is  chosen  as  the  state  variable  for  the  FEM  analysis  and  an  approach  similar  to  that  of 
the  first  case  study  is  applied  to  compute  the  derivative  of  the  object  function.  In  this  case  study,  to  avoid 
the  difficuties  in  modifying  the  mesh  during  the  optimization  procedure  the  following  linear  constraints  are 
imposed  on  the  parameters: 

po  =  5.0  (13a) 
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(a)  (b) 

Fig.  3:  Optimum  Designs  a.  with  out  linear  constraints  b.  with  the  linear  constraints 

pi>0.01,  i=  (13b) 

The  solution  region  is  divided  into  two  regions  (Fig.  2b),  Region  1  does  not  change  during  the 
optimization  procedure  and  can  be  considered  as  fixed.  The  nodes  of  Region  2  are  allowed  to  move 
homothetically  along  the  y-direction  [15].  This  approach  is  considered  here  to  maintain  the  mesh  topology 
and  to  apply  the  subdomain  technique  [18]  for  the  solution  of  the  algebraic  system. 

The  optimum  design  (shown  in  Fig.  3a)  obtained  with  the  maintenance  of  the  mesh  topology  and  the 
linear  constraints,  (13a)  and  (13b),  seems  to  be  not  smooth  enough  and  cannot  be  implemented  [16-18]. 
But  it  was  shown  that  the  imposing  the  following  quadratic  constraints  along  the  pole  face  smoothen  the 
surface  [17]: 


and 


r  Po  ~  Pi  _  Pi  ~  Pi+  lT 
Lxpi  -  Xpo  hj+i  J 

fpi-l  '  pi  pi  ~  pi+l~[ ' 

L  hi  “  hi+]  J 


<  e, 


i=l 


(14a) 


hj  ^i+1 

where  e  is  a  very  small  positive  value, 


xpi 


i>l  (14b) 

is  the  x-coordinate  of  the  selected  node  Pi  and  hi  is  the  distance 


between  the  x-coordinates  corresponding  to  the  selected  nodes  Pj.i  and  Pi. 
These  quadratic  constraints  are  equivalent  to  the  linear  constraints: 


and 


-  e  <  1 

r  PO  -  Pi 

Pi  “  Pi+r 

]<. 

i=l 

Lxpi  -  XpO 

hi+i  . 

-  8<  1 

fPi-l  -  Pi  ] 

L  hi 

pi  '  Pi+l~| 

hi.M  J 

<  e, 

i>  1 

(15a) 


(15b) 


The  active  set  strategy  is  - —  ^ -  —  .  .  *  i.  j  u 

optimum  design  (shown  in  Fig.  3b)  obtained  with  these  linear  constraints  is  smooth  enough  and  the 
implementation  of  the  linear  constraints  is  very  simple  in  comparison  to  the  above  mentioned  structural 
mapping  and  design  element  techniques. 


Conclusion 

An  overview  of  the  deterministic  gradient  based  optimization  was  given.  Further  difficulties  in 
implementing  the  technique  were  discussed  and  the  results  of  two  case  studies  employing  linear  constraints 


400 


were  presented.  In  the  first  case  study,  GIPM  is  successfully  applied  at  multi-levels  to  optimize  a  magnetic 
circuit  to  obtain  a  constant  flux  density  in  a  air  gap  with  a  low  leakage  inductance.  It  was  shown  that  some 
design  optimizations  require  multi-level  optimizations  to  reach  an  optimum  design,  when  some  of  the 
components  of  the  derivative,  (dF/dp),  of  the  object  function  are  dominant.  In  the  second  case  study,  some 
linear  constraints  are  efficiently  implemented  to  avoid  the  unrealistic  optimum  designs  of  the  shape 
optimization  of  a  magnetic  pole  face. 
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Abstract 

This  paper  presents  the  electromagnetic  engineering  analysis  of  an  HF  communication  antenna  system 
on  the  PC-1  Coastal  Patrol  Class  ship  using  NAVSEA  Electromagnetic  Engineering  (EMENG) 
computer  tools.  Shipboard  antenna  system  modeling  techniques  are  addressed.  The  geometry  of  the 
ship  is  represented  by  a  NEC-MoM  wire  grid  model  extracted  from  a  3-D  solid  drawing.  Geometry 
arid  segmentation  processing  checks  of  the  wire  grid  model  are  made  simple  using  the  Numerical 
Electromagnetic  Engineering  Design  System  (NEEDS),  which  color  codes  the  model  for  visual 
analysis.  Antenna  performances  are  predicted  by  the  Numerical  Electromagnetic  Code  (NEC4),  and 
the  results  are  post-processed  by  NEEDS,  which  provides  three-dimensional  plots  for  visualization. 
Electromagnetic  Compatibility  (EMC)  analysis  of  the  co-located  HF  communications  systems  is  then 
performed  by  the  Communications  Engineering  Design  System  (COEDS)  running  the  Cosite  Analysis 
Model  (COSAM 3.0). 


Introduction 

Much  of  the  communications  equipment  onboard  U.S.  Navy  ships  operates  at  HF  frequencies,  where 
wavelengths  range  from  10  m  (at  30  MHz)  to  150  m  (at  2  MHz).  Because  ships  and  objects  typically 
found  onboard  are  often  of  dimensions  comparable  to  these  wavelengths,  the  characteristics  of  the 
shipboard  HF  antennas  are  altered  by  their  physical  surroundings.  In  order  to  accurately  predict 
antenna  performance,  the  effects  of  the  entire  ship  must  be  taken  into  account.  This  goal  may  be 
achieved  through  measurements  performed  on  brass  scale  models  of  the  ships  and  their  antennas,  which 
has  been  done  for  many  years  at  NRaD.  Alternatively,  the  ship  may  be  modeled  numerically  by  a  wire 
grid,  and  calculations  substituted  for  measurements  of  antenna  parameters.  This  discussion  steps 
through  the  process  of  creating  a  wire  grid  model  of  the  PC-l  coastal  patrol  class  ship,  calculating 
antenna  parameters,  postprocessing  and  visualization  of  the  results,  and  the  use  of  these  results  to 
perform  shipboard  EMC  analysis. 


L  The  Wire  Grid  Model 

For  the  PC-1  project,  the  NEC-MoM  wire  grid  model  was  extracted  from  a  CAD-II  3-dimensional 
solid  drawing.  The  shipyard-provided  solid  drawing  is  given  in  Figure  1  below.  Using  the  Intergraph 
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Engineering  Modeling  System  (EMS)  2.2  on  the  CAD-II  workstation,  the  command  "Extract  Part  of 
Curve"  creates  a  line  element  from  a  portion  of  an  existing  curve.  This  is  the  first  project  to 
successfully  utilize  this  technique,  which  resulted  in  the  most  accurate  wire  grid  model  developed  for  a 
ship  in  numerical  modeling  history.  The  PC-1  NEC-MoM  numerical  wire  grid  model  consists  of  1820 
elements,  as  shown  in  Figure  2. 


Figure  1.  PC-1  Solid  Drawing 


Figure  2.  PC-1  NEC-MOM  Wire  Grid  Model 


n.  Model  Validation 


The  newly  extracted  wire  grid  model  must  be  validated  before  it  can  be  used  to  analyze  the  on-board 
antennas.  The  necessary  checks  are  performed  on  NRaD's  NEEDS  (Numerical  Electromagnetic 
Engineering  Design  System)  workstation,  which  was  developed  under  NAVSEA's  EM  Engineering 
program  to  assist  in  making  the  NEC-MoM  process  less  tedious  and  more  error-free.  The  geometry 
checks  include  verification  of  wire  segmentation,  radius,  and  connections.  These  are  visual  checks 
which  involve  color-coding  the  individual  segments  of  the  wire  grid  model.  The  topside  engineer  may 
thus  tell  by  a  glance  where  problems  occur  within  the  model. 

In  the  segmentation  check,  each  wire  segment  is  color  coded  by  length.  For  HF  analysis,  segment 
lengths  of  about  one  meter  are  desired  (0.1  to  0.007^  for  HF).  Color  coding  allows  for  a  quick  check 
that  most  of  the  wires  are  one  meter  long  and  that  none  are  outside  the  allowable  range.  For  the  wire 
radius  check,  almost  all  of  the  wires  forming  the  ship's  surface  will  show  a  radius  of  0.16  meter  (0.016 
to  0.001^),  which  has  been  chosen  to  produce  the  equivalent  surface  area  for  a  one-meter  wire  spacing. 
The  only  wires  which  should  be  colored  differently  are  the  antennas  and  any  other  non-surface  wires, 
for  which  the  actual  radii  are  entered.  Finally,  for  the  connections  check  the  wire  segments  are  color- 
coded  to  indicate:  (a)  both  ends  are  connected,  (b)  one  end  is  connected,  or  (c)  the  wire  is  completely 
unconnected  to  anything  else.  This  check  enables  stray  wires  to  be  picked  up  as  well  as  verification  that 
the  nodes  of  the  wire  grid  mesh  are  properly  joined.  The  color  coding  for  connection  of  a  single  end 
indicates  antennas,  flagpoles,  guns,  and  other  non-surface  objects  which  are  only  connected  to  other 
conductive  structures  on  one  end. 

If  problems  are  found  in  the  wire  grid  model,  the  wires  are  simply  modified  using  AutoCAD  and  then 
run  through  this  process  again,  until  no  further  errors  occur. 


m.  NEC  Calculations 

Once  an  error-free  wire  grid  model  has  been  created,  NEC4  is  used  to  calculate  HF  antenna 
performances.  For  the  PC-1  project,  the  two  forward  23'  HF  whip  antennas  are  designated  PI  and  SI 
for  port  and  starboard.  The  NEC-generated  results  include  coupling  between  these  antennas  (see 
Figure  3).  Additionally,  the  impedance  of  each  antenna  is  calculated  as  a  function  of  frequency  by 
exciting  the  antenna  under  study  and  solving  for  the  current  distribution.  Figure  4  shows  the  impedance 
of  antenna  PI,  which  is  almost  identical  to  the  impedance  calculated  for  SI,  since  the  ship  is 
symmetrical  along  its  lengthwise  axis.  Antenna  mismatch  loss  is  calculated  using  the  NEC  impedance 
information.  The  mismatch  loss  and  coupling  data  will  be  used  later  as  part  of  the  EMC  analysis. 
Among  the  most  interesting  results  provided  by  NEC  calculations  are  the  radiation  patterns,  which  can 
be  used  as  part  of  the  communications  system  performance  evaluation  and  link  studies. 
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Figure  3.  Antenna  Coupling 
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Figure  4.  PI  Antenna  Impedance 


IV.  Postprocessing  and  Visualization 

Postprocessing  of  the  NEC  calculations  is  performed  using  the  NEEDS  workstation  once  more.  This 
system  has  the  capability  of  providing  two-  and  three-dimensional  plots  of  the  ship  along  with  the  NEC- 
calculated  results.  This  display  is  an  important  part  of  being  able  to  interpret  the  large  amount  of  data 
generated  by  a  NEC  run.  For  example,  the  near  field  generated  by  an  onboard  transmitter  may  be 
represented  by  color-coded  dots  of  increasing  size  and  intensity  as  a  ftinction  of  field  strength,  allowing 
for  a  visual  interpretation  of  where  the  strongest  fields  occur.  NEEDS  allows  the  user  to  step  through 
the  frequency  band,  keeping  the  ship  stationary  and  changing  only  the  intensity  and  patterns  of  the  near 
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field.  Additionally,  the  display  can  be  set  to  show  only  the  fields  above  a  certain  threshold  value,  such 
as  the  level  defined  for  Hazardous  Electromagnetic  Radiation  for  Personnel  (HERP)  or  for  Ordinance 
(HERO).  A  NEEDS-generated  display  of  the  near  field  due  to  the  PI  antenna  radiating  at  400  watts 
and  14  MHz  is  shown  in  Figure  5. 


Figure  5.  Near  field  due  to  PI  radiating  at  400  W,  14  MHz 


Among  the  other  possibilities  for  3-D,  color-coded  visualization  using  the  NEEDS  workstation  are  the 
current  distributions,  charges,  near  electric  field,  near  magnetic  field,  and  radiation  patterns.  A  NEEDS 
visualization  of  the  radiation  pattern  appears  in  Figure  6.  For  this  plot,  the  relative  magnitude  of  the 
field  is  represented  by  distance  fi-om  the  origin,  and  color-coding  indicates  the  E-field  phase. 


407 


Figure  6.  Radiation  pattern  due  to  PI  radiating  at  400  W,  iO  MHz 


V.  EMC  Analysis  Using  COEDS 

The  Communications  Engineering  Design  System  (COEDS)  workstation  is  another  part  of  the 
NAVSEA  EM  engineering  (EMENG)  program.  Electromagnetic  Compatibility  (EMC)  analysis  is 
performed  to  determine  the  effects  of  transmitters  on  co-located  receive  systems.  In  COEDS,  a 
database  is  created  from  measurement  or  from  specifications  data  for  each  piece  of  equipment  in  the 
communications  subsystems.  Mathematical  models  are  built  to  describe  the  operation  of  receivers, 
transmitters,  couplers,  and  antennas.  Then  the  elements  of  different  shipboard  transmitting  and 
receiving  subsystems  may  be  configured  by  selecting  them  from  the  database  and  connecting  them  in 
the  form  of  a  block  diagram.  The  isolation  between  transmit  and  receive  antennas  calculated  earlier 
using  NEC  may  be  input  into  the  system,  as  well  as  antenna  mismatch  losses  calculated  from  the  NEC 
impedance  data.  Thus  COEDS  takes  into  account  the  effects  of  the  shipboard  environment  on  antenna 
behavior  (through  the  incorporation  of  NEC  data)  as  well  as  the  behavior  of  the  communications 
equipment. 

COEDS  may  be  used  to  compute  the  excess  interference  levels  due  to  the  following  interference  types: 
(1)  receiver  adjacent  signal,  (2)  broadband  transmitter  noise,  (3)  spurious  emissions,  (4)  spurious 
response,  (5)  receiver  intermodulation,  and  (6)  transmitter  intermodulation.  The  goal  of 
communications  system  design  is  that  receiver  performance  be  noise  limited,  i.e.,  that  the  interference 
due  to  on-board  transmitters  should  be  less  than  the  maximum  of  receiver  noise  and  ambient  noise.  In 
the  design  stage,  the  two  major  EMC  interference  interaction  parameters  which  have  a  great  impact  are 
receiver  adjacent  signal  and  transmitter  broadband  noise. 

If  the  COEDS  results  show  that  interference  levels  exceed  the  noise  levels,  several  mitigation 
techniques  are  available.  These  include  the  following: 

1.  Equipment  changes 

2.  Addition  of  filtering  on  the  receive  side 

3.  Addition  of  filtering  on  the  transmit  side 

4.  Change  of  antenna  location  to  increase  space  isolation 
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5.  Frequency  management  (increase  of  frequency  separation  between  transmitter  and  receiver) 

6.  Reduction  of  transmitter  power 

1.  Time  management  of  equipment  use 
8.  Compromise  of  design  goals 

Option  4  can  be  simulated  using  the  NEC-MoM  wire  grid  model,  while  the  effects  of  options  1-3  and  5- 
6  can  be  computed  with  COEDS.  In  a  single  COEDS  run,  several  of  these  options  may  be  combined 
at  once.  In  the  case  of  the  PC-1  EMC  analysis,  the  interactions  between  the  following  transmit  and 
receive  configurations  were  calculated  using  COEDS: 


Transmitters 

1.  Transmitting  at  400  W 

2.  Transmitting  at  200  W 

3.  Transmitting  at  100  W 

4.  Transmitting  at  50  W 

5.  Postselector  filter  added,  400  W 

6.  Postselector  filter  added,  200  W 

7.  Postselector  filter  added,  100  W 

8.  Postselector  filter  added,  50  W 


Receivers 

1 .  Baseline  receiver 

2.  One  preselector  filter  added 

3.  Two  preselector  filters  added 


COEDS  calculates  the  interference  interactions  for  every  combination  of  these  subsystems  at  a  variety 
of  user-chosen  receiver-tuned  frequencies  and  transmitter  offsets.  An  interaction  matrix  for  every 
combination  of  these  transmit  and  receive  systems  is  then  formed,  indicating  the  predicted  interference 
levels  in  each  case.  From  these  results,  an  accurate  assessment  of  shipboard  electromagnetic 
compatibility  is  accomplished. 
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ABSTRACT 

New  visualization  tools  are  being  developed  to  aid  in  interpreting  the  output  from 
computational  codes  and  measurements  used  In  the  design  and  analysis  of  the 
electromagnetic  topsides  of  Navy  ships.  One  of  the  principal  outputs  of  interest  Is  the 
radiation  pattern  of  shipboard  antennas  systems  (e.g.,  communication,  electronic 
warfare,  surveillance,  ...).  A  large  quantity  of  data  can  be  generated  by  both 
computational  electromagnetic  codes  and  measurement.  This  paper  demonstrates  the 
support  that  advanced  visualization  provides  to  the  analysis  of  shipboard  antenna 
patterns. 

1.0  INTRODUCTION 

The  objective  of  a  ship  topside  antenna  study  is  to  determine  an  antenna's 
impedance,  near  fields,  radiation  pattern,  and  coupling  to  other  antenna  elements. 
These  technical  parameters  are  necessary  to  determine  the  performance  of  the  total 
shipboard  antenna  system.  Impedance  can  be  related  to  component  compatibility. 
Near  fields  are  used  to  evaluate  radiation  hazards  to  personnel,  fuel  and  ordnance. 
Coupling  between  antennas  is  necessary  for  the  evaluation  of  the  effect  of 
electromagnetic  interference.  The  radiation  patterns  are  used  as  part  of  an  evaluation 
of  the  link  performance  of  the  individual  antenna  system. 

Both  computational  electromagnetic  codes  and  measurement  can  generate  a 
large  quantity  of  pattern  data.  Fast  and  efficient  analysis  of  this  large  amount  of  data 
requires  sophisticated  visualization  techniques.  A  user  friendly  interface  Improves 
utilization  of  the  entire  visualization  capability.  Initial  visualization  programs  were 
written  using  the  Silicon  Graphics  (SGI)  proprietary  IRIS  GL  (Graphics  Library).  The 
SGI  IRIS  Inventor,  OSF  Motif  and  MATLAB  are  now  being  used  to  provide  user  friendly 
scene  Interaction  for  a  suite  of  electromagnetic  visualization  programs.  Inventor  is  a 
C++-callable  library  of  functions  which  allows  simplified  development  of  GL-based 
scene  building  and  scene  interaction.  MATLAB  is  a  computation  environment  for 
numeric  computation,  data  analysis,  and  visualization.  Commensurate  with  the 
increasing  capability  in  visualization  tools  is  the  need  for  an  easily  accessible  user 
interface. 


410 


2.0  THREE  DIMENSIONAL  SURFACE 

Two  types  of  three  dimensional  surfaces  can  be  used  for  displaying  three 
dimensional  pattern  data.  The  radiation  pattern  is  complex  (i.e.,  magnitude  and  phase) 
data  which  is  a  function  of  theta  and  phi.  The  theta  angle  is  defined  from  the  z-axis 
toward  the  x-y  plane.  The  phi  is  defined  from  the  x-axis  in  the  x-y  plane.  The  radiation 
pattern  also  has  two  components  in  the  theta  and  phi  directions,  respectively.  For  a 
given  component  the  pattern  can  be  displayed  as  a  colorful  three  dimensional  polar 
surface.  The  distance  from  a  point  on  the  surface  to  the  origin  is  proportional  to  the  log 
of  the  magnitude  of  the  data  at  that  point.  The  color  is  coded  to  the  phase  of  the  data. 
This  creates  a  very  interesting  display  as  shown  in  figure  1.  This  display  can  be 
rendered  as  a  solid  or  as  a  wire  grid.  It  can  be  rotated  to  allow  it  to  be  viewed  from  any 
direction.  Depending  on  the  user's  preference,  there  are  several  variations  on  this 
display.  The  color  can  be  coded  to  magnitude  to  emphasize  magnitude.  Another 
display  emphasizes  the  elevation  angle  by  coding  the  color  to  elevation  angle.  A 
second  type  of  three  dimensional  display  plots  the  magnitude  as  a  surface  on  a 
rectangular  grid  of  theta  and  phi  values.  As  displayed  in  figure  2,  this  type  of  three 
dimensional  plot  helps  to  visualize  the  sidelobe  and  backlobe  structure  of  directional 
antennas.  The  principal  sidelobes  and  backlobes  can  be  emphasized  by  applying  a 
threshold  to  this  type  of  plot  and  only  displaying  magnitudes  above  the  threshold. 


Figure  1.  Three  dimensional  polar  radiation  pattern. 
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Figure  2.  Three  dimensional  surface  radiation  pattern. 

3.0  TWO  DIMENSIONAL  SURFACE 

Severs!  two  dimensional  displays  for  visualization  radar  cross  section  (RCS) 
have  been  suggested  [1].  As  a  variation  of  these  suggested  RCS  displays,  the 
radiation  pattern  could  also  be  displayed  as  a  two  dimensional  polar  surface.  The 
distance  from  the  center  is  the  theta  angle  and  the  distance  around  a  circle  at  a  given 
distance  from  the  center  is  the  phi  angle.  The  color  is  coded  to  the  magnitude  of  the 
radiation  pattern.  This  again  creates  an  interesting  display.  The  two  dimensional  polar 
display  for  figure  1  is  given  in  figure  3. 


Figure  3.  Polar  display  of  radiation  pattern. 
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4.0  THRESHOLDING 

The  three  dimensional  radiation  pattern  display  provides  wonderful  qualitative 
information  regarding  the  performance  of  an  antenna  system,  but  often  what  is  needed 
Is  quantitative  information.  As  a  result  a  three  dimensional  thresholding  technique  has 
been  developed  using  a  semitransparent  thresholding  grid.  This  allows  a  reference 
antenna  such  as  a  quarter  wave  monopole  to  be  overlaid  with  the  original  display.  The 
reference  antenna  provides  a  threshold  for  the  pattern.  The  reference  can  be  stepped 
in  1  decibel  step  sizes.  This  allows  a  quick  interpretation  of  the  antenna  performance 
for  a  given  pattern  at  a  particular  theta  and  phi  value.  An  example  of  this  display  is 
shown  in  figure  4.  The  portion  of  the  radiation  pattern  above  the  threshold  is  quite 
apparent. 


Figure  4.  Radiation  pattern  with  thresholding  grid. 


5.0  SLICING 

In  this  section  two  methods  are  described  for  presenting  a  "slice”  of  the  three 
dimensional  data  to  better  visualize  aspects  of  the  radiation  pattern.  A  visualization 
capability  was  developed  for  interactive  slicing  of  the  radiation  pattern  on  planes  with 
arbitrary  orientation.  This  would  allow  the  user  to  quickly  analyze  the  directional 
performance  of  an  antenna  system.  This  capability  would  also  facilitate  inclusion  of 
radiation  pattern  data  in  reports  and  presentations.  Using  IRIS  Inventor  on  the  Silicon 
Graphics  workstation,  a  program  was  developed  to  provide  interactive  slicing  through 
three  dimensional  radiation  pattern  data.  The  name  of  this  program  is  PatternCut. 
Data  is  displayed  on  a  polar  plot.  A  slider  bar  is  provided  to  allow  the  user  to 
interactively  slice  through  the  data.  A  version  of  the  program  exists  which  displays  the 
data  on  a  linear  graph.  This  version  is  named  LInCut.  Examples  of  the  displays 
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generated  by  PatternCut  and  LinCut  are  shown  In  figures  5  and  6,  respectively. 


Figure  5.  Example  display  generated  by  PatternCut. 


Figure  6.  Example  display  generated  by  LinCut. 

The  radiation  pattern  which  is  displayed  is  read  from  a  user-specified  file.  At  this 
time  the  program  is  designed  to  accept  the  format  for  the  file  created  by  the  Numerical 
Electromagnetic  Code  -  Basic  Scattering  Code  (NEC-BSC)  [2].  The  output  pattern  files 
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created  by  NEC-BSC  have  the  extension  .oaa.  The  first  line  of  each  file  is  a  text 
header  line.  After  that  each  row  of  the  file  contains  data  for  a  particular  theta  and  phi 
location.  Each  output  file  contains  six  columns  of  data.  These  six  columns  are  for: 
theta,  phi,  E-theta  (magnitude  in  dB),  E-theta  (phase  in  degrees),  E-phi  (magnitude  in 
dB),  E-phi  (phase  in  degrees). 

The  user  may  provide  the  name  of  the  file  containing  the  pattern  data  on  the 
command  line  when  first  starting  the  program  or  a  Motif  File  Selection  Dialog  Box  will 
appear.  When  the  program  is  initiated,  a  window  opens.  This  window  has  a  pull-down 
menu  bar  at  the  top.  Below  the  menu  bar  is  a  graphics  display  area  and  below  this 
display  area  is  a  slider  widget. 

The  menu  bar  has  two  items:  File  and  Edit.  The  sub-menu  under  File  consists 
of  Open  and  Quit.  The  hot  key  for  Open  is  Alt  +  o’.  The  hot  key  for  Quit  is  Alt  +  q'. 
Selecting  Open  causes  a  Motif  File  Selection  Dialog  Box  to  appear.  Selecting  Quit 
closes  the  program.  The  submenu  under  Edit  consists  of  three  sets  of  radio  button 
menus.  The  first  set  lets  the  user  choose  between  displaying  the  E-Theta  or  E-Phi 
component  of  the  data.  E-Theta  is  the  vertical  component  of  the  data;  E-Phi  is  the 
horizontal  component  of  the  data.  The  second  set  of  radio  buttons  allows  the  choice  of 
a  Horizontal  Cut  or  a  Vertical  Cut.  The  final  set  of  radio  buttons  allows  the  user  to 
select  between  Change  Cut  or  Animate. 

If  Horizontal  Cut  is  selected,  the  radiation  pattern  is  displayed  on  a  conical  cut. 
The  slider  bar  is  used  to  select  the  theta  value.  All  data  at  that  given  theta  value  is 
displayed.  In  the  display  window  the  angle  around  the  outside  of  the  polar  plot  is  phi. 
The  polar  plot  displays  the  data  in  decibels  (dB)  and  has  a  50  dB  dynamic  range.  If 
Vertical  Cut  is  selected,  the  data  is  displayed  on  a  vertical  cut.  The  slider  bar  is  used  to 
select  the  phi  level.  All  data  at  that  given  phi  value  is  displayed.  In  this  case  the  angle 
around  the  outside  of  the  polar  plot  is  theta. 

If  more  than  one  file  has  been  selected  (such  as  by  repeated  use  of  the 
File/Open  menu  choice),  then  the  Animate  option  allows  the  user  to  slide  through  the 
radiation  pattern  in  the  several  files.  In  this  case  the  slider  bar  is  used  to  select  the  file. 
This  is  of  use  in  cases  where  a  sequential  series  of  output  files  has  been  created.  An 
example  of  this  occurs  when  a  radar  system  is  sweeping  through  a  range  of  angles  and 
an  analysis  has  been  run  at  incremental  scan  steps.  Animating  the  data  in  this  fashion 
allows  for  analysis  of  time  dependent  effects. 

Selecting  the  radio  button  Change  Cut  returns  to  the  case  where  the  slider  bar 
changes  either  theta  or  phi  (depending  on  whether  Horizontal  Cut  or  Vertical  Cut  is 
selected). 

A  vertical  slice  as  a  range-height  radar  surface  is  presented  in  figure  7  [3].  This 
plot  emphasizes  the  significance  of  the  beam  pattern  in  the  low  elevation  angles  and 
was  formed  using  MATLAB. 
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Figure  7.  Radar  range-height  plot. 


7.0  SUMMARY 

Although  much  of  the  presented  visualization  products  were  developed  on  a 
Silicon  Graphics  workstation,  this  software  is  currently  being  converted  to  a  format 
usable  on  many  UNIX  workstations.  This  will  be  accomplished  by  converting  from 
SGl’s  proprietary  IRIS  GL  and  Inventor  to  the  multiplatform  versions  OpenGI  and  Open 
Inventory.  MATLAB  is  already  available  on  a  variety  of  platforms.  As  a  future  objective, 
the  radiation  pattern  visualization  options  will  be  consolidated  into  a  single  program  with 
a  graphical  user  interface. 

The  radiation  pattern  is  one  of  several  measures  of  antenna  performance  that  an 
engineer  uses  to  evaluate  a  given  antenna  design.  Several  different  forms  of  radiation 
pattern  visualization  have  been  presented.  A  measure  of  performance  of  a  given 
visualization  of  a  radiation  pattern  is  its  value  in  guiding  an  engineer  to  an  optimal 
design.  High  performance  visualization  enhances  an  engineer's  appreciation  for  some 
of  the  design  criteria  utilized  in  antenna  design. 
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Abstract 

The  Finite  Difference  Time  Domain  (FDTD)  method  has  been 
extensively  used  to  study  radiation  from  a  variety  of  radiating  stnic- 
t\u'es,  including  various  shaped  end  radiators  {SER)[1].  In  tliis  pa¬ 
per  we  discuss  the  application  of  FDTD  to  a  novel  liigh  gain  SER, 
wliich  efficiently  radiates  the  TMqi  mode.  This  structure  is  a  transi¬ 
tion  from  a  circ\ilai-  to  elhptical  waveguide  wliich  feeds  an  empirically 
shaped  radiating  sinface.  Included  wiU  be  descriptions  of  the  geom¬ 
etry,  techniques  for  generating  the  computational  domain,  and  direct 
comparisons  between  FDTD  calculated  results,  on  an  orthogonal  Yee 
grid,  and  the  experimentally  measured  directivity  of  this  radiator.  Ad¬ 
ditionally,  comments  will  be  made  regai’ding  the  application  of  such  a 
radiator  as  well  as  the  Tisefulness  of  FDTD  for  analyzing  such  anten¬ 
nas, 

1  Introduction 

As  iHxxls  in  th(^  RF  iiropagatioii  (toimriiinity  contiuiKi  to  grow,  so  will  the 
rccpiinniicnt  for  conqjnct,  cffici(mt  non  fundairKnital  inod<^  i  fuliating  ant(mnas. 
Many  RF  powca-  sonrc(^s  produ(:(^  (^lung)^  in  i\n)  circularly  sj^iiiiKitric  TMqi 
iriodc  within  a  cylindrical  waveguide.  The  standard  ajiiiroach  to  iruliation 
involves  iriod(i  conversion  to  th(^  fundairKUital  TEn  inodes  and  radiation  from 
a  conical  horn.  Alternativedy,  om)  may  (efficiently  radiatee  tliee  TMqi  mode  di- 
reectly  into  a  high  gain,  high  epiality  beeam  from  an  SER.  As  with  any  anteenna 
<leesigii  process,  immeerical  simulations  can  providee  imjiort-ant  information  ree- 
garding  peerformancee  character  istics.  Duee  to  thee  eeveer  increejLsing  cost  of  eexper- 
imeiital  inveestigations,  well  planeed  and  eexecuteed  computational  analysis  teeiid 
to  guidee  thee  anteenna  deesigneer  in  theeir  (efforts.  With  this  in  mind,  wee  havee 

^With  the  Department  of  Electrical  and  Computer  Engineering,  Mississippi  State  Uni¬ 
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Ixxiii  utilizing  tin;  FDTD  iruitluxl  to  study  the;  radiation  chaifudoristics  from 
an  in-lious(i  d(iV(‘lop(xi  novel  radiating  d(ivi(:(i.  In  this  pajx^r  wci  will  show  how 
FDTD  is  suit(Hl  for  iim^stigations  of  this  sort  and  provid(5  som<5  pndiininary 
results  of  th(^  simulations  eomjjfucxi  with  (^xperiiiKUit.  AdditiomUly  we  will 
ineluchi  diseiissions  of  how  tlxi  nuiiKuieal  simulations  ar<i  augmentcid  through 
the  us(!  of  symbolie  mathematieal  i>aekages  such  rus  Math(unatiea©to  both 
d(^v(do])  th(i  computation al  grid  as  W(dl  as  analyr/<i  the  data. 


2  Shaped  End  Radiator 

Th<i  SER  f(^atur(\s  an  (dliptical  waveguides  transition  from  soirxs  in])ut  eeceii- 
triesity  to  a  final  output  (sccesntricity.  Tins  (sccesntricity  is  givesn  by 


whesns  a  is  the  minor  axis  radius  and  h  is  th(s  major  axis  ralius.  Tins  out¬ 
put  transition  tlxsn  feesds  an  ai)proximately  shaped  radiating  surface.  This 
particular  SER,  as  shown  in  Figures  L  has  a  zesre)  input  esescesntriesity  (esylin- 
elrical  wavesguieles)  miel  sme)e)thly  transitie)ns  ewesr  appre)ximatesly  threse  guieles 
waveslesiigths  te)  a  =  0.9  besfe)res  sesattesring  e)nte)  a  raeliating  suifaese  that  is 
ap])re)ximat(sly  an  e)fFsest  re)tatesel  i)arabe)lie:  esllii)se)iel.  Thes  ressiilting  gese)mestry 
thesn  e)nly  bes  aehsepiatesly  elessesribesel  as  a  s])e)e)n  shapesel-enel  raeliate)r  (SSER). 

The  input  SER  wavesguieles  is  fesel  with  a  TMqi  esylinelriesal  wavesguieles  mesele. 
Fesr  esach  TM^^n  meseles  e)f  a  esyliixlriesal  wavesguieles,  thesres  exists  a  eseuresspeuieling 
esvesn  anel  e)elel  meseles  (TM^„  anel  TMJ„„  resspesestivesly)  esf  an  eslliistiesal  wavesguieles 
esxesespt  fe>r  the  TEqi  anel  TMqi  me)eless  for  whiesh  thesres  is  enily  a  eseuTesspeuieling 
esvesn  meseles.  Sinese  a  es3dinelriesal  wavesguieles  is  an  elliptiesal  wavesguieles  esf  zesre) 
escesesiitriesity,  the  input  TMcn  meseles  e:an  bes  vieswesel  as  a  TNIg^  eslliptiesal  wavesg¬ 
uieles  me)eles.  Thes  SER  funestiesns  b}’  maintaining  thes  input  TMgt  esllii)tiesal 
threuigh  thes  transitiesn  until  it  is  sesattesresel  esnto  a  raeliating  suifaese  ineluesing 
a  esurresnt  elistributiesn  that  lesaels  te)  a  highly  eliresestive,  high  ejuality  beam,  te) 
help  maintain  thes  TMqi  meseles,  thes  pressesnt  SSER  is  elessignesei  se)  as  te)  he)lei  a 
ese)nstant  me)eles  impeselaneses  e)f  296  Q  thre)ugh  thes  traiisitiejii.  The  remainelesr 
e)f  the  SSER  is  thesn  esmpiriesally  shapesel.  Thes  aestual  SSER  input  w'avesguieles 
eliamestesr  is  1.44/\  anel  the  e)utput  apesrtures  is  appresximatesly  4.75A  by  4.00A 
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at  iho  18  GHz  t(^st  fr<xiii(ni(:y.  Using  th(^  larg(^st  aix^rtun^  diirKnision  and  the 
standard  (:rit(ai{)n  for  tho  iK^rU'  to  far  f3(dd  boundary  (r  >  2D“/A),  wo  s(io 
that  the  far  fi(dd  is  any  distances  gr<^at(n-  than  32 A.  Since  th(^  wavelength  is 
0.01667  niet<Ts,  the  far  field  for  this  d(wic(i  is  any  distaiic(i  gn^aten-  that  0.5333 
HK^ters. 

The  SSER  was  testcxl  in  a  10'  by  10'  by  8'  an(x:hoic  chrunber  fully  lined 
with  12  ''  pyrainidal  absorlxu'  utilizing  an  aut()inat(Hl  sph(uical  inai)ping  <lata 
ac(iuisiti()n  system.  The  anechoic  absorber  had  a  rated  return  loss  for  normal 
inci<l(mt  energ)*  of  50  dB  from  10  to  30  GHz.  The  ant(uina's  input  was  g(uier- 
ate<l  by  an  HP  8673  synth<^siz(^d  signal  geiKU-ator,  mnplifi(xl  by  mi  HP  8349A 
power  amplifi(U'.  Just  jirior  to  th(i  SSER  input,  th<^  signal  was  sensiid  by  an 
HP  8484A  jiower  s(uisor  for  analysis  of  the  incident  wav<Torm.  The  ant<mna's 
radiated  pattiun  was  by  anotlKir  HP  84 84 A  si^iisor  attached  to  the 

input  of  a  Ku  band  pyramidal  horn  ant(uma  iiositioinxl  one  mister  from  the 
radiator.  The  s(uised  i>ow(us  wcu'e  tlnui  iiK^asunxl  by  an  HP  438 A  dual  chan¬ 
nel  pow(U'  mister.  Th(^  TMq^  mod(^  was  laiinch(^d  at  the  injiut  of  the  SER  by 
a  half  gnid(^  waviihnigth  ant(mna  located  on  axis  at  the  terminatiul  Ixiginning 
of  th(i  mod(^  launclKu-.  Tlu^  TMqi  mod(i  was  th(ui  verificxl  by  th(^  far  fi<dd 
pattern  m(^asur<un(uit  of  a  fiat  circular  cut  cylindrical  wavi^guide  atta:h(xl  to 
the  launching  se<‘tion.  MeasunurKuit  data  for  the  SSER  was  tak(Ui  ov(U'  the 
azimuthal  angh^  0  <  <?;  <  180  in  five  degr<ie  incnuiKXits  and  ov(t  the  polar 
angle  0  <  ^  <  120  in  two  degr(x^  increments. 

For  g<ui('rating  tlu^  computational  domain,  th(^  radiator  is  ])ositioned  along 
the  2:  axis  and  is  approximat(dy  d<^scrib(xi  in  r(^ctangiilar  coordinates  by 


+  h-^z) 


(2) 


wh(a-{^  th(^  major  and  minor  ax(;s  of  the  wav(^guid(!  ari^  iiolynomial  functions 
of  th{^  distanc(i  along  the  z  mxis  given  by 


a{z)  =  (iQ  +  2r  +  (loz-  +  a-iZ'^  +  a.^z^  (3) 

h(^z)  =  6o  +  2  T  T  ^3-2'^  +  h,]Z^ .  (4) 

Some  sections  of  the  wavc^guidi^  and  the  radiating  surfac(^  involv<i  an  offs(^t 
function  giviui  by  f{z)  in  Ecpiation  (2).  In  th(^  wav<^guid(\,  towards  th<^  radi¬ 
ating  suiface  feed,  the  olfsi^t  maintains  a  flat  wav(^guid(^  bottom  as  indicated 
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in  Fignn^  2  whih  th(',  oEi^ct  in  th(^  rfuliator  is  appr()xiinat(^ly  an  offset  rotatcxi 
parabola. 

A  siinpl(^  VAihic  0(^11  hkisIi  g(in<n-at()r  w^us  writtcai  using  tlu;  abov(^  (ujiiations 
as  th<i  boundfLiy.  Tli(^  roiitiiu^  was  writtcni  in  Math(iinatioa©booanso  of 
th(^  (^as(i  of  i)rograinniing  and  d(d)ngging  within  its  iiit(irpn)tiv(i  cnvironiricnt. 
Its  graphic  visnali/ation  (:ai)abilitics  also  i>rov(Ml  very  valuable  for  validat¬ 
ing  th(!  aeenra:y  of  th(^  final  g(ni(nat(xl  iiK^sh.  As  shown  in  Figiin^  3,  the 
iri(\sh  g(ni(n  ator  did  ind(M^d  produce  a  clean  computational  domain  for  FDTD 
simulation. 

3  FDTD  Analysis 

TIk)  computational  domain  <i<5scrib(xl  in  tin;  previous  se(:tion  was  used  in  all 
FDTD  simulations.  The  ant(Uina  was  (ixcit<id  by  a  half  guid<)  wavelength 
monop ol(i  radiator  posit ioiuid  oiui  c(^ll  from  tlui  closed  (uid  cap  in  the  model 
[2].  Tli<i  gap  was  then  (ixcit<nl  by  a  z  (d(^ctric  field  compoiKuit  with  a  ramped 
sinusoid  function  of  the  form 

Ei{t)  =  r(f)sin(u;ot)  (5) 

wh<u-<^  r{t)  is  the  ramping  function.  As  with  the  (^xperiiiKUital  investigation, 
the  op(U‘ating  fn^iuency  was  chos(m  to  b(^  18  GH/.  To  allow  ack^cpiate  traii- 
sients  in  the  guid(\  th<^  simulation  was  nm  for  2048  tim(ist('.ps  with  each 
timostep  being  0.863  picos(iConds.  In  the  FDTD  code,  Liao  boundary  condi¬ 
tions  (uiclos(xi  the  comi>utational  domain  with  a  si)acing  of  five  white  space 
cells  from  tlu^  mesh.  WIkui  developing  the  m<^sh.  th(i  spatial  discretization 
was  37  cells  per  fn^e  spac(^  wav(d(uigth  (or  46  c(dls  per  guides  wavehnigth). 
This  fin(^  c.(dl  size  {S  =  0.448  mm)  allowcxl  the  cuivaturci  of  th()  object  to  be 
resolv(id,  (^specially  in  the  curved  radiating  surfaces  of  th(^  antenna.  In  the 
cod(^  w(^  w(U'(^  utilizing,  the  fi(dds  an^  savc'.d  on  an  int<^gration  surfaces  after 
th(^  driving  function  reacluis  steady  stat(\  The  data  was  tlnui  post  processed 
to  calculate  th(',  far  y.oju)  radiat(xl  fi(dds.  W<^  sampk^l  data  for  -90  <  ^  <  90 
in  two  (k^gr(K5  increirnuits  and  0  <  o  <  180  in  fiv(i  (kigrc^c^  increnumts.  This 
produced  a  compkd,(i  fi(dd  map  ov(u-  th(^  (Uitini  ui)i)er  half  plains. 

Th<^  comput('.d  fi(dd  data  wer(^  post  i)rocess(xl  into  contour  field  plots 
through  the  us(^  of  th(^  softwan^  i)ackag(^  Mathematica©.  We  utilized  an 
in-house  d(iV(dop<id  jjrogram  to  produce  such  contour  maps.  This  allowed 
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ns  to  quickly  company  the  computed  results  with  those  obtained  from  the 
experimental  portion  of  th(^  study. 


4  Results 

Initial  results  indicat<i  rc^asonably  good  agr<Hmi(Uit  with  th(^  <ixp(U'im(uitfLl  data 
for  this  ant(^nna.  The,  FDTD  computcKi  r(\sults  show  slight  flutten-  in  the  far 
y.oue  contour  plots.  This  discr(q)ancy  can  be  attributcxl  to  a  number  of  possi- 
1)1(5  problems  with  th(5  simulations  and  ar(5  curn^iitly  b(5ing  invest igat (5(1.  Pos- 
sibl(5  sourc(5s  of  this  (5rr()r  include  scatt(5ring  from  cubi(5al  discr(5tizati()n  us(5d 
to  model  the  ant(5nna.  (5Sp(5cially  in  th(5  si)oon  shap(5d  radiating  surfa(5(5-  giid 
disp(5rsi()n.  and  improper  laun(5hing  of  th(5  initial  fi(5lds  at  th(5  ant(5nna's  fe(5d. 
This  (5rror  can  be  1(5(1  uc(5d  by  cr(5ating  a  fin(5r  m(5sh  at  th(5  cost  of  significantly 
in(5r(5rus(5(l  burd(5ns  on  c()mput(5r  r(5S()ur(5(5s  w(5ll  rus  (5ar(5ful  (5onsid(5ration  of 
the  f(5eding  of  the  ant(5nna.  Work  on  this  probl(5m  is  continuing. 

5  Conclusions 

Preliminaiy  data  from  the  aiit(5nna  indi(5at(5s  that  two  import, ant  factors  must 
be  consid(5red  wh(5n  using  FDTD  for  a  g(5()m(5try  of  this  sort.  First,  a  w(5ll 
constru(5t<5d  computational  domain  must  b(5  us(5d  in  all  simulations.  This 
can  pot(5ntially  reduc(5  sourc(5s  of  (5rror  in  th(5  simulations  sin^h  as  s])(5(5ular 
r(5fle(5tions  ad  grid  disi)ersion.  S(5C()ndly.  car(5ful  att(5ntion  must  be  paid  to  th(5 
launching  the  initial  mode  distiibution  insid(5  the  wav(5gui(l(5.  If  the  initial 
f(5(5d  is  i)laced  in(;oiT(5ctly  in  th(5  m()d(5l.  an  fusymmetric  fi(5l(l  distribution  will 
b(5  g(5n(5rat(5d  which  will  significantly  (5ff(5ct  th(5  far-/()n(5  radiat(5d  fi(5lds.  Work 
with  this  radiator  and  th(5  appli(5ation  of  the  FDTD  m(5thod  is  on  going  in 
an  (5ffort  to  r(5(lu(5(5  th(5  piuviously  described  (5tf(5(5ts.  Du(5  to  time  (constraints 
with  th(5  publi(5ati()n  d(5adlin(5s.  full  (5()iT(5ct(5d  simulations  hav(5  y(5t  to  be  com- 
])l(5at(5(l.  At  presentation,  corr(5ct(5d  data  for  th(5  ant(5nna  will  b(5  shown  with 
discussion  of  th(5  (5rrors  and  tlucir  r(5m(5di(5S. 
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Figure  1:  Photograph  of  SSER. 
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A  Time  Domain  Harmonic  Oscillator  Model  for  an  FDTD 
Treatment  of  Lossy  Dielectrics 

K.S.  Kunz 

Department  of  Electrical  Engineering 
Penn  State  University 


Introduction 

An  harmonic  oscillator  model  of  polarization  in  a  lossy 
dielectric  material  provides  a  useful  alternative  to  the  usual  FDTD 
treatment  of  lossy  dielectrics  best  characterized  by  (1)  where 

VxH(t)=e.eo-^^^+eo-^^|^»S(t)+o£(t)  VxE{t)=-^S^  (1),  (2) 

In  this  alternative  formulation  the  first  curl  equation  is  replaced 
by 

VxH(t)  +  ,  P(t)  =NgcX(.t)  (3),  (4) 

The  displacement  x(t)  is  found  in  the  time  domain  from  the  harmonic 
oscillator  equation 

mx  +  dx  +  kx  -  fit)  =  QoEit)  (5) 

The  formulation  then  is  entirely  in  the  time  domain  and  permits 
modeling  nonlinearities  through  the  extension  of  kx+kaX^+kaX^H- •  •  •  and 
can  include  breakdown  effects  as  well  as  material  modulation 
effects. 

Derivation  of  Harmonic  Oscillator  Constants 

Two  widely  occurring  cases  will  be  treated.  They  are  Debeye 
and  Lorentz  materials.  Debeye  materials  such  as  water  are 
characterized  by  molecules  with  permanent  dipole  moments  that  are 
randomly  oriented  until  acted  on  by  an  external  electric  field. 
Under  the  influence  of  this  external  field  they  align  to  produce  a 
net  polarization.  Lorentz  materials  have  no  permanent  dipole 
moment.  The  positive  and  negative  charges  contained  within  them 
separate  under  the  influence  of  an  external  electric  field  and 
thereby  produces  a  polarization  field. 

Debeve  Materials 

The  harmonic  equation  used  to  describe  a  Debeye  material 
incorporates  only  damping  with  a  zero  spring  constant  (k=0) .  The 
governing  differential  equation  is  then 

mx  +  dk  =  fit)  =  qoEit)  (6) 

For  impulsive  excitation 
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(7) 


fit)  =  q^Eit)  =  5(t) 

and  the  corresponding  impulsive  polarization  is 


Pj  (  C)  =  Nga  ( t) 

so  that 

(8) 

(  t)  =  Pi  (  t)  /NQq 

(9) 

The  impulsive  polarization  in  the  frequency  domain 
terms  of  the  susceptibility  x(<«>) 

is  defined  in 

Pjlw)  =eoX(«)B(co)  =eoX(w)/g-o 

where 

(10) 

E(t)  =  6(t)/go 

has  been  used  to  find 

(11) 

E(U>)  =  l/<?o 

For  Debeye  materials  x(“)  is  given  by 

(12) 

-  ('t.  ] 

(13) 

Substituting  this  expression  into  (10)  and  inverse  transforming 
yields  the  corresponding  time  domain  expression  for  the  impulsive 
polarization 

Pfit)  =eaXit)/go  (14) 

where 


X(t)  =  u(t)  (15) 

Substituting  this  expression  for  P^Ct)  into  the  equation  for  x^it) 
in  terms  of  P^Ct)  yields 

-Jfs(t)=eoX(t)/-N'‘?o=-^f-^^V’''''°  (^®) 

Ngo\  ^0  ) 
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so  that 


d  -  1/Ao  (21) 

The  constant  m  and  d  are  now  determined  and  72ix  +  dx  =  QQEit) 
becomes 


(^)  "•* 


x)  =  qE{t) 


(tnX  -  x)  =  E{t) 


Lorentz  Materials 

The  same  derivation  holds  for  Xj(t),  namely  that  it  is  given  by 

=  €oX  ( t) (24) 
The  susceptibility  of  a  Lorentz  material  in  the  frequency  domain  is 
X(ci))  =  (e^  -  ej /(coo  +  2j<i)x  ~  <i)2)  (25) 

with  the  corresponding  time  domain  expression 

X  ( t)  ={[(eg-ej  o>o/  (coo-a^)  (26) 

Thus 

x^  ( t)  ^o/  (wo-a^)  ^^^]e"**sin(Qo”a^)  (27) 

For  the  harmonic  oscillator  equation  to  return  this  solution  for 
impulsive  excitation  the  constants  jn,  d  and  k  must  be  suitably 
adjusted.  Following  the  same  lines  as  for  the  Debeye  material  the 
constants  m,  d  and  k  are  determined  to  be 

277  - ^2. — -  ,  cf  =  2ina  ,  k  =  mcDo  (28) 

(Gg-e„)  <i)o 

The  harmonic  oscillator  equation  mx  +  dx  +  Jcx  =  q^Ei  t)  becomes 

NQa  - - - r  [*  +  2a^c  +  Q?x]  =  (29) 

(e^-ej  <i)o 

Scattered  Field  Implementation 

A  scattered  field  implementation  is  readily  effected  by 
differencing  the  curl  equations  as  in  equation  (2)  except  that  for 
the  curl  of  H  total  equation  (3)  is  employed  so  that 

^  At  ^  A  t 

-  ’  =  Co  (31) 

At 


Vxif^-"*i=eo  +  O 


This  resulting  curl  of  H  scattered  equation  must  be  further 
expanded.  In  particular  the  harmonic  oscillator  equation  can  be 
solved  for  x"'^^  so  that 
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(33) 


m 


■  (2X''-X”-^)  +^x”4g.  (SS.n*l+Bi-n*l) 
i  At 


,  (A£)2 

For  notational  compactness  define 

Y='  ^ 


^  -+^+icl 


(At)2  At 


(34) 


4^  4JC 

L  (At)2  At  J 

Substituting  the  expression  for  x"*^  into  the  curl  of  H  scattered 


equation  yields 


m 

^0 

(2X”-X'’*^)  4— ^X”4^  (£S,n*l 

yAt  Y 


u 

This  can  be  solved  for  to  complete  the  algorithm  with 


(/^o-q/y) 


(  2m  d 

11  (At)^  At  T 

(At)2  J 

}/  (€o+WqoVY+cyA  t) 


(35) 


(36) 


Results 

The  completed  harmonic  oscillator  model  of  lossy  dielectrics 
is  not  unlike  the  frequency  independent  implementation  and  only 
requires  one  additional  add  and  multiply.  It  therefore  runs  in  a 
nearly  comparable  time.  Memory  requirements  are  increased  by  as 
much  as  a  factor  of  two  by  the  need  for  storing  the  one  and  two 
time  step  prior  displacements  in  the  x,y  and  z  directions  for  the 
three  E  field  components  of  each  cell. 

Figure  1  is  a  one  dimensional  calculation  of  scattering  from 
a  half  space  of  water,  a  Debeye  material,  that  replicates  the 
frequency  dependent  lossy  dielectric  model  behavior  of  (2)  in 
figure  8.5  demonstrating  the  harmonic  oscillator  model  »s  capability 
to  replicate  the  frequency  dependent  lossy  dielectric  model *s 
prediction.  The  water  air  interface  is  at  the  500^^  cell  and  the 
gaussian  probe  has  already  encountered  the  interface  and  been 
mostly  reflected.  The  calculation  was  made  with  a  three 
dimensional  algorithm  modified  for  one  dimensional  geometries. 


(1)  R.  Luebbers,  D.  Steich  and  K.  Kunz,  ”FDTD  Calculation  of 
Scattering  from  Frequency-Dependent  Materials,”  IEEE 
Transactions  on  Ant.  and  Prop.,  Sept.  1993. 

(2)  The  Finite  Difference  Time  Domain  Method  for  Electromagnetics, 
K.S.  Kunz  and  R.J.  Luebbers,  CRC  Press  1993. 
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FDTD  Modeling  of  Electromagnetic  Wave  Interactions 
with  Composite  Random  Sheets 

James  G.  Maloney  and  Brian.  L.  Shirley 

Signature  Technology  Lab,  Georgia  Tech  Research  Institute 
Georgia  Institute  of  Technology 
Atlanta,  Georgia  30332 

VVe  will  present  a  new  approach  to  computing  the  interaction  of  electromagnetic  plane  waves 
with  composite  sheets  (conductor/dielectric  mixtures).  The  approach  is  based  on  using  the  finite- 
difference  time-domain  (FDTD)  method  to  solve  Maxwell’s  equations  subject  to  the  relevant 
boundary  conditions  imposed  by  the  composite  sheets.  Specifically,  the  cases  considered  were 
thin  sheets  (thickness  <  wavelength)  composed  of  a  random  distribution  of:  parallel  bars  (ID 
randomness),  patches  (2D  randomness),  and  small  cubes  (3D  randomness).  The  constituents 
(bars,  patches,  cubes)  were  either  dielectric  or  conductor  (both  finite  and  infinite  conductivity 
were  considered).  The  relative  fraction  of  constituents  was  varied  from  below,  to  near,  to  above  the 
percolation  threshold.  The  quantity  of  interest  for  this  research  was  the  frequency  dependence  of 
the  transmission  and  reflection  coefficients  as  a  function  of  fill  fraction  and  degree  of  randomness. 
During  the  presentation,  visualizations  (movies)  of  the  electromagnetic  waves  interacting  with 
composite  sheets  will  be  shown  to  illustrate  the  different  mechanisms  involved  for  the  different 
degrees  of  randomness.  Comparison  of  numerical  predictions  with  experimental  measurements  for 
the  high  conductivity  cases  will  also  be  presented. 
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An  Improved  Near  to  Far  Field  FDTD  Algorithm 
K . S .  Kunz 

Penn  State  University 
Department  of  Electrical  Engineering 

Introduction 

A  commonly  employed  near  to  far  field  algorithm  (1)  can  be 
derived  from  the  expression  (eg.  4.22,  Stratton  (2)) 

(i?  X  if)  4)  +  (n  X  £■)  X  V<t)  +  (n  •  ^)  V<|>)  da 

s 

where  0  =  e^^'Vr,  by  replacing  iw  by  5/5t  and  ik  by  d/3r.  The 
result  is 

where 

N(.t)  =  — fjs(.t+  {f  •  i) /c  -  R/c)  ds'\  ,  =  h  X  H(t) 

4nRc  at]  J,  I 


U{t)  = — i — -J-f  f  M.(t+  [2'-£) /c  -  R/c)ds'  \  ,  M^{t)  =  -h  X  Bit) 
4ni?c  9ti  J,  I 


A  worthwhile  improvement  to  this  approach  is  to  turn  to 
equation  4.26  of  Stratton 

V  X  E{x^,y^,  z^)  =  f  (n  X  ff(x,y,  z) )  x  V<J>  da 

47U  j 

This  is  derived  from  (1)  straightforwardly  by  noting  the  V'  x  is 
in  the  primed  coordinates.  Now 

V' X  ^;(x^y^  ^0  =  ico^iif'(x^y',  zO  (2) 


if{x^y^z0  ^  — i-  f  (n  X  if(x,y,  z) )  x  V<t>  da 
J 


Equation  (3)  can  now  be  employed  to  formulate  a  near  to  far 
algorithm,  instead  of  (1).  The  advantage  is  treating  only  one 
field  quantity,  a  gain  in  simplicity  and  computational  cost. 
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Time  Domain  Evaluation  of  the  Far  Field  Expression 


Starting  with  (3) 

where 

V4..(i  -ik)  f. 

(4) 

and  in  the  limit  r-^o  so  only  the  -ik  term  survives,  the 
expression  for  H(x’,  y*,  z*)  becomes  the  far  field  expression 

_  Jj£n  f  [{nxH{x,y,  z))  xf^]  da 

4  J  r 

3 

(5) 

Now  setting  k=(i>/c 

z')  = 

=  j  [  (nxH(x,y,  z)  xfp]  e  da 

(6) 

Fourier  transforming  this  expression  to  the  time  domain, 

noting 

that  iu  becomes  a/at  and  is  a  time  retardation  which  can  be 

ignored,  yields  the  time  domain  expression  for  the  far  field 
response 

t)  =— ^  -^flnxH{x,y,z,t)xf„]  da  (7) 

47tcr  otJ 

s 

In  evaluating  this  expression  r  is  normalized  to  unity  and  is 
the  unit  vector  pointing  from  anywhere  on  the  surface  to  the 
observation  point. 


Far  Field  Implementation 

A  rectangular  problem  space  is  assumed  with  a  "white  space" 
four  cells  wide  or  more.  This  allows  an  interior  surface  to  be 
defined  that  is  one  and  a  half  cells  in  from  the  faces  of  the 
problem  space  and  free  of  the  scatterer.  This  inner  boundary 
surface  cuts  through  two  H  field  components  that  are  tangential 
to  the  inner  boundary.  The  surface  integral  equation,  equation 
(7) ,  can  be  evaluated  using  these  components  at  a  selected  number 
of  observation  points,  each  defined  by  observation  angles 

ande^^^(n)  .  A  relative  delay  time  is  associated  with  the 
contributions  from  each  field  component.  It  is  derived  from  the 
offset  of  the  field  point  compared  to  the  problem  space  center 
along  the  direction  of  observation.  The  contribution  of  each 
field  point  is  weighted  by  the  area  of  a  cell  face  centered  about 
the  component.  One  exception  is  the  components  along  the  edges 
of  the  inner  boundary.  They  are  weighted  by  a  half  cell  face 
area . 

The  relative  delay  is  in  general  not  an  integral  multiple  of 
the  time  step  At.  Therefore  the  delayed  contribution  is  divided 
into  two  contributions  to  the  two  closest  time  intervals  given  by 
integral  multiple  of  At  using  a  simple  linear  interpolation 
scheme.  Each  H  field  component  on  a  face  can  contribute  to  two 
field  components  at  the  observation  location.  At  each  time  step 
the  contribution  of  a  single  component  on  the  inner  boundary  to  a 
single  component  at  the  observation  location  can  occur  at  two 
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adjoining  time  intervals.  A  running  accumulation  in  time  of 
these  contributions  from  all  inner  boundary  components  yields  an 
accurate  time  history  of  the  total  scattered  response  at  the 
observation  point. 


Results 

Figures  1-3  are  the  three  far  field  H  field  components 
(x,y,z  respectively)  for  a  square  metal  plate  17  x  17  cells  in 
size  illuminated  by  a  256  time  step  wide  Gaussian  pulse.  The 
response  is  essentially  the  same  as  that  seen  in  reference  1 
figure  7.2  for  a  similar  geometry  and  angle  of  incidence  (0  = 
45°,  (p  =  3  0°)  for  the  backscattered  radiation. 


1)  K.S.  Kunz  and  R.J.  Luebbers,  The  Finite  Difference  Time 
Domain  Method  for  Electromagnetics.  CRC  Press,  1993. 

2)  J.A.  Stratton,  Electromagnetic  Theory.  McGraw  Hill,  1941. 
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Abstract 

The  finite-volume  hybrid-grid  (FVHG)  algorithm  com¬ 
bines  unstructured  finite-volume  grids  with  traditional 
finite-difference  time-domain  (FDTD),  The  grids  are 
completely  coupled  without  the  need  for  spatial  mterpola- 
tion.  The  finite-volume  grid  is  defined  to  terminate  on  a 
rectangular  surface  that  may  be  located  very  close  to  the 
geometry  under  study,  which  minimizes  overhead.  Ab¬ 
sorbing  boundary  conditions  with  >40  dB  attenuation  are 
easily  applied  in  the  bounding  FDTD  grid.  In  this  paper, 
the  FVHG  algorithm  is  extended  to  fi-ee-meshed,  tetrahe¬ 
dral-element  imstructured  grids.  This  provides  finite- 
element-like  modeling  flexibility  with  explicit  time  differ¬ 
encing.  Multi-material  regions  can  be  solved.  The  FVHG 
algorithm  is  efficient  and  accurate,  even  with  complex 
free  meshes  generated  by  advanced  solid-modeling  soft¬ 
ware.  Long-term  stabilization,  material  averaging,  and 
complications  with  tetrahedral  grids  are  discussed. 

1.0  Introduction 

Traditional  finite-difference  time-domain  (FDTD)  codes 
have  been  used  extensively  over  the  past  ten  to  twenty 
years  for  the  electromagnetic  (EM)  analysis  of  geometries 
with  low  to  moderate  complexity.  These  codes  are  based 
on  structured  cubical-hexahedral  cells  that  model  geome¬ 
tries  with  a  staircase  approximation.  The  advancement 
technique  uses  the  Yee  algorithm  that  can  be  written  in 
terms  of  simple  contour  integrations  [1].  Much  research 
has  been  invested  in  the  development  of  grid  truncation 
techniques  [2,3],  near-to-far-field  transformations  [4],  and 
sub-grid  algorithms  [5,6].  For  geometries  that  conform  to 
a  rectangular  grid,  FDTD  is  efficient  and  accurate  using 
10-20  cells  per  wavelength.  For  arbitrary  geometries, 
traditional  FDTD  can  become  impractical  because  hun¬ 
dreds  of  cells  per  wavelength  may  be  required  to  ade¬ 
quately  approximate  curvature,  with  the  required  resolu¬ 
tion  being  dependent  on  incident  angle  and  polarization 
[7,8]. 

Although  FDTD  will  remain  popular  for  many  years, 
there  is  considerable  interest  in  the  development  of  accu¬ 


rate  and  efficient  body-conforming  volumetric  algorithms. 
Transient  solvers  based  on  computational  fluid  dynamics 
(CFD)  techniques  [9],  finite- volumes  [10,11,12],  local 
contour  modifications  [13],  and  overlapping  non- 
rectangular  and  rectangular  grids  [14]  are  tmder  develop¬ 
ment.  Finite-element  approaches,  although  primarily  used 
in  the  frequency  domain,  are  receiving  attention  in  the 
time  domain  [15].  The  accuracy  and  efficiency  associated 
with  various  non-orthogonal  cell  shapes  is  of  considerable 
importance  with  all  methods,  and  late-time  stability  be¬ 
comes  an  additional  concern  with  transient  approaches. 
Some  algorithms,  such  as  the  CFD-based  approaches, 
advance  both  the  electric  and  magnetic  fields  on  a  single 
grid,  whereas  the  approaches  described  in  [10-14]  use  two 
offset  grids.  The  latter  techniques  reduce  identically  to 
the  FDTD  Yee  algorithm  when  the  grid  is  rectangular  and 
can  be  referred  to  as  generalized- Yee  algorithms. 

Much  of  the  previous  work  on  transient  non-orthogonal 
algorithms  is  based  on  grids  generated  by  mapped- 
meshing  techniques.  Such  grids  are  generally  composed 
of  hexahedral-  and/or  wedge-shaped  cells  and  are  block- 
structured,  although  the  grids  can  be  referenced  using 
unstructured  techniques.  Mapped-meshmg  usually  re¬ 
quires  partitioning  ftie  geometry  to  construct  the  mesh  and 
is  therefore  not  as  flexible  as  free-meshing  algorithms. 
Free  meshing  generally  gives  rise  to  fully  unstructured 
tetrahedron  grids,  although  free-meshing  hexahedral  al¬ 
gorithms  are  presently  being  developed  with  the  CUBIT 
project  at  Sandia  National  Laboratories  [16].  Mapped- 
mesh  grids  provide  a  more  visually  satisfying  mesh  than 
grids  obtained  from  free-meshing,  but  the  overall  quality 
of  the  grid  with  regard  to  cell  stretch  and  distortion  is  of¬ 
ten  superior  with  tetrahedral  elements. 

The  present  paper  emphasizes  free-meshing  with  linear 
tetrahedrons  when  used  in  conjunction  with  the  finite- 
volume  hybrid-grid  (FVHG)  technique  [12].  For  open- 
region  problems,  the  FVHG  algorithm  embeds  the  un¬ 
structured  grid  in  a  traditional  FDTD  code.  Either  an 
analytic  or  a  Berenger-type  absorbing  boimdary  condition 
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is  easily  applied  in  the  structured  grid  without  incurring 
considerable  overhead.  The  unstructured  region  is  de¬ 
signed  to  terminate  so  that  a  direct  connection  to  the 
structured  grid  occurs  without  grid  interpolation.  The 
overlapping  grid  algorithm  [14]  requires  spatial  interpola¬ 
tion  between  grids.  Many  complications  have  been  expe¬ 
rienced  with  free-meshed  tetrahedrons  that  did  not  occur 
with  mapped  meshes,  and  these  will  be  discussed 
throughout  the  paper.  The  notation  in  this  paper  differs 
somewhat  from  that  used  in  [12]  to  simplify  the  generali¬ 
zation  presented  here. 

Unstructured  fmite-volume  algorithms  demand  consid¬ 
erably  more  computer  resources  than  FDTD.  Thus,  the 
ideal  finite-volume  code  provides  complete  solid¬ 
modeling  flexibility  while  minimizing  overhead.  Indeed, 
one  of  the  most  powerful  aspects  of  volumetric  solvers 
lies  in  their  ability  to  accommodate  the  inhomogeneous 
materials  that  can  be  easily  generated  using  advanced 
solid-modeling  programs.  Key  to  the  FVHG  algorithm  is 
a  transition  to  the  structured,  external  FDTD  grid  as 
quickly  as  possible.  As  described  below,  this  transition 
has  been  accomplished  within  a  minimum  of  1-2  FDTD 
spatial  steps  from  solid  models  generated  using  commer¬ 
cial  products.  This  distance  is  remarkably  close  to  the 
solid  model  and  has  lead  to  highly  resolved  grids  without 
an  excessive  number  of  non-orthogonal  cells.  For  exam¬ 
ple,  the  NASA  9.936-inch  almond  was  modeled  to  3.5 
GHz  at  13  “cells”  per  wavelength  using  65  thousand  tet¬ 
rahedral  elements,  of  which  15  thousand  elements  filled 
the  almond’s  interior.  This  particular  grid  could  accom¬ 
modate  different  materials  within  the  almond. 

Of  course,  an  alternative  approach  is  not  to  transition  to  a 
structured  grid  at  all  and  instead  terminate  the  grid  using 
simple  impedance-matched  absorbing  layers  [17,18]  filled 
with  unstructured,  non-orthogonal  cells.  Although  satis¬ 
factory  damping  of  the  transient  response  can  be  obtained 
and  parallelization  of  the  field  advancement  occurs,  many 
(>  10)  layers  are  often  required.  Because  the  number  of 
non-orthogonal  cells  required  to  fill  these  layers  can  be¬ 
come  extremely  large  the  hybrid-grid  approach  has  been 
found  to  be  preferable,  at  least  when  using  free-meshed 
tetrahedral  elements  in  the  non-orthogonal  region.  An 
area  of  future  research  that  may  show  promise  is  the  ex¬ 
tension  of  a  Berenger-type  perfectly  matched  layer  to 
non-orthogonal  cells,  but  only  if  it  can  be  applied  such 
that  it  is  nearly  adjacent  to  the  solid  model  and  provides 
>40  dB  damping  with  only  a  few  layers  of  limited  thick¬ 
ness. 

The  scope  of  the  paper  is  as  follows.  In  Section  2,  grid- 
generation  programs  applicable  to  the  FVHG  algorithm 


are  discussed.  In  Section  3,  a  technique  to  interface  free- 
meshed  unstructured  grids  to  the  structured  FDTD  mesh  is 
presented.  In  Section  4,  the  mathematical  foundation  of 
the  FVHG  algorithm  is  developed  and  results  are  pre¬ 
sented.  Conclusions  are  given  in  Section  5. 

2.0  Grid  Generation 

Many  commercial  and  non-commercial  programs  exist  to 
generate  either  mapped  or  free  meshes.  The  FVHG  al¬ 
gorithm  requires  that  the  non-orthogonal  region  termi¬ 
nates  on  a  rectangular  parallelepiped  such  that  structured, 
cubical  hexahedrons  can  be  directly  interfaced.  Mapped- 
mesh  programs  are  well  suited  to  this  form  of  termination. 
INGRID  [19],  CUBIT  [16],  and  I-DEAS  [20]  are  three 
examples  that  have  been  used  successfully.  A  fourth  ex¬ 
ample  that  shows  promise  for  large-scale  mapped-mesh 
grid  generation  is  based  on  the  parallel  mesh  project  at 
Lawrence  Livermore  National  Laboratory.  PMESH  [21] 
possesses  an  ACIS  solid-modeling  kernel  and  is  capable 
of  generating  tens  of  millions  of  cells. 

Programs  that  are  capable  of  generating  free  tetrahedral 
elements  that  terminate  with  the  required  properties  for 
use  with  the  FVHG  algorithm  are  much  more  difficult  to 
obtain.  Two  commercial  products  have  been  evaluated: 
(1)  PRO/Engineer  [22],  and  (2)  I-DEAS  [20]. 
PRO/Engineer  permits  the  user  to  terminate  the  grid  on  a 
parallelepiped  with  specified  fixed-node  locations.  It  does 
not,  however,  generally  adhere  to  specified  maximum  and 
minimum  edge  lengths  on  the  terminating  surface.  As  a 
consequence,  additional  nodes  may  be  placed  on  the 
parallelepiped  and/or  the  tetrahedral-element  bases  may 
not  possess  a  required  isosceles  form.  Either  of  these 
situations  fail  to  establish  the  interface  of  the  surrounding 
structured  grid. 

I-DEAS  permits  the  surface  bounding  the  unstructured 
grid  to  be  partitioned  into  subsurfaces,  which  obviates  the 
meshing  problems  experienced  with  PRO/Engineer.  In 
addition,  the  meshing  module  of  I-DEAS  has  been  found 
to  be  much  more  flexible,  particularly  with  regard  to  us¬ 
ing  symmetry  and  reflecting  elements.  I-DEAS  was  used 
for  all  free-meshed  FVHG  results  in  this  paper.  Because 
the  FDTD  structured  grid  can  be  interfaced  extremely 
close  to  the  free-meshed  grid,  I-DEAS  shows  promise  for 
the  transient  FVHG  analysis  of  complex  problems  ranging 
from  circuit  boards  to  biomedical  applications.  However, 
I-DEAS  currently  does  not  provide  a  parallel-processing 
capability,  and  thus  grids  are  limited  to  a  practical  number 
of  1-2  million  cells.  It  is  noted  that  CUBIT’S  3D  plaster¬ 
ing  algorithm  for  the  generation  of  free-meshed  hexahe- 
dral  elements  may  be  of  future  interest  when  the  algo¬ 
rithm  matures  to  accommodate  complex  geometries. 
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An  I-DEAS  tetrahe¬ 
dron  grid  of  a  sphere 
embedded  in  a 
closely  spaced  cubi¬ 
cal  box  with  the 
proper  termination 
characteristics  is 
shown  in  Fig.  2.1. 


3.0  Unstructured/Structured  Mesh  Interface 

A  hexahedral  element  interfaced  to  the  bases  of  two  tetra¬ 
hedral  cells  is  shown  in  Fig.  3.1.  For  use  with  the  FVHG 
algorithm,  one  face  of  this  hexahedron  is  split  into  two 
isosceles  triangles;  this  element  is  treated  as  a  seven-faced 
element.  A  layer  of  these  special  hexahedral  elements  is 
wrapped  completely  around  the  unstructured  grid.  This 
layer  is  referred  to  as  the  interface  region.  The  tetrahe¬ 
dron  grid  terminates  on  the  inner  boundary  of  this  region 
while  the  FDTD  grid  connects  to  the  outer  boundary. 
Standard  cubical  FDTD  cells  interface  directly  to  the 
special  hexahedral  elements  without  spatial  interpolation. 


Inner  Boundary  of 
Interface  Region 


The  field  advancement  scheme  is  as  follows.  The  electric 
(E)  fields  in  the  structured  grid  are  initially  advanced. 
The  FDTD  E-fields  on  the  outer  boundary  of  the  interface 
region  are  second- order  time  interpolated  and  mapped 
into  the  unstructured-grid  registers.  The  unstructured 
region  is  advanced  (possibly)  several  sub-time  steps  as 
dictated  by  the  Courant  condition  in  this  region.  The  un¬ 
structured  E-fields  are  subsequently  mapped  into  the 
FDTD  registers  on  the  inner  boundary  of  the  interface 


region.  Advancing  the  FDTD  magnetic  (H)  fields  in  the 
structured  grid  completes  the  overall  time  step. 

The  scheme  provides  a  seamless  spatial  connection  be¬ 
tween  the  unstructured  and  structured  regions.  The  time- 
interpolation  could  be  eliminated  by  using  the  same  At  for 
both  grids;  however,  the  interpolation  can  reduce  the 
overall  execution  time  without  introducing  lower-order 
error. 

4.0  FVHG  Algorithm 

The  non-orthogonal  region  is  defined  in  terms  of  a  pri¬ 
mary  grid  and  a  dual  (or  derived)  grid.  The  basic  struc¬ 
ture  of  the  primary  grid  is  generated  from  the  solid¬ 
modeling/grid-generation  program;  a  preprocessor  forms 
the  dual  grid  along  with  all  necessary  cell  information  for 
both  grids.  The  centroids  of  the  primary-grid  cells  corre¬ 
spond  to  the  vertices  of  the  dual-grid  cells.  Vector  H- 
fields  are  defined  at  primary-cell  centroids,  whereas  vec¬ 
tor  E-fields  are  defined  at  primary-cell  vertices.  The  ver¬ 
tices  of  the  primary  cells  are  contained  within  the  dual 
cells,  but  they  do  not,  in  general,  fall  precisely  at  the  loca¬ 
tion  of  the  dual-cell  centroid.  Because  the  vector  H-fields 
are  advanced  using  primary  cells,  the  primary  grid  is 
equivalently  labeled  an  H-grid.  Similarly,  the  dual  grid  is 
equivalently  labeled  an  E-grid  because  vector  E-fields  are 
advanced  using  the  dual  cells.  The  algorithm  accommo¬ 
dates  primary  cells  that  are  arbitrary  convex  polyhedrons. 

For  mapped-mesh  grids  composed  mainly  of  hexahedral 
elements,  the  number  of  cells  on  the  primary  and  dual 
grids  is  roughly  equal.  For  free-meshed  tetrahedron  grids, 
frie  number  of  dual  cells  has  been  found  to  be  roughly 
one-fourth  to  one-sixth  the  number  of  primary  cells,  with 
each  cell  containing  many  nodes,  faces,  and  edges,  but 
this  can  vary  considerably  depending  on  the  form  of  the 
primary  grid.  The  distribution  of  dual  cells  is  output  from 
the  preprocessor. 

A  typical  H-grid  tetrahedral  element,  referenced  as  Hj,  is 
shown  in  Fig.  4.1.  Dual  cells  (not  shown)  surround  the 
four  vertices;  these  are  referred  to  as  =  1-4.  The  dual 

edge,  s/ ,  connects  the  centroids  of  primary-cell  i  with  a 
neighboring  cell,  referenced  as  primary-cell  k.  The  pri¬ 
mary  edge,  s  *  ,  connects  E ;  and  Ej.  The  volume  of  pri¬ 
mary-cell,  /,  is  denoted  V-^,  while  the  volume  of  dual¬ 
cell,;,  is  denoted  V] .  The  area  of  face,  /,  on  primary- 
cell,  /,  is  denoted  by  while  the  (average)  area  of  face, 
p,  on  dual-cell,;,  is  denoted  Ajp.  nf/  and  denote 
average  outward  normals  for  each  face  of  the  primary  and 
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dual  cells,  respectively.  (Average  face  areas  and  normals 
are  used  because  the  (dual-cell)  faces  may  be  non-planar.) 


By  integrating  over  each  cell  on  the  primary  and  dual 
grids,  the  fmite-volume  time-domain  (FVTD)  form  of  the 
scattered-field  equations  is 


Fig.  4.1.  Cell  definitions. 

In  total  and  scattered  forms,  Maxwell’s  equations  are: 
Total: 

,1  — H* +a„H*  =  -VxE*  (4.1a) 

dt 

£  — E*  +cr.E‘=  VxH*  (4.1b) 

dt 

Scattered: 

5  ^  c 

dt 

-Vxe'-(h-hJ  — (4.2a) 
dt 

=  VxH*  -(e-s  )-^E‘-agE'  (4.2b) 

dt  dt 

Here,  e,crg,  p,  cj^,  denote  permittivity,  electric  conduc¬ 
tivity,  magnetic  permeability,  and  magnetic  conductivity, 
respectively.  Total  fields  have  a  superscript  t,  while  scat¬ 
tered  and  incident  fields  have  superscripts  s  and  i,  respec¬ 
tively.  The  scattered  field  equations  (4.2a,  4.2b)  require 
linearity.  Although  either  formulation  can  be  used,  the 
scattered-field  equations  have  been  found  to  provide  in¬ 
creased  accuracy  and  dynamic  range  with  open-region 
problems  by  eliminating  grid  reflection  of  the  incident 
field  at  the  interface  between  unstructured  tetrahedrons 
and  the  structured  FDTD  grid.  In  the  following,  only  the 
scattered-field  form  of  the  equations  will  be  presented, 
with  the  total-field  results  being  similar. 


Because  the  basic  structure  of  the  primary  grid  is  output 
from  the  solid-modeling  program,  all  materials  are  de¬ 
fined  relative  to  primary  cells.  For  magnetic  materials, 
this  is  the  natural  definition.  However,  electric  materials 
are  associated  with  the  spatially  staggered  dual  cells  and 
therefore  the  permittivity  and  electric  conductivity  require 
averaging.  The  average  values  are  denoted  by  zj  and 
Q-g  respectively  (more  will  said  about  material  averag¬ 
ing  techniques  in  Sections  4.5,  4.6). 

4.1  Area-Integral  Approximation 
There  are  at  least  two  possibilities  for  the  evaluation  of 
the  area  integrals  in  Eqs.  4.3,  which  is  why  the  symbol, 
was  used  over  the  area-integral  vector  fields:  (1)  For  each 
face,  the  vector  fields  at  all  of  the  vertices  are  first  (linear 
or  volumetric)  averaged  and  then  vectorially  crossed  with 
the  average  face  normal.  This  result  is  finally  multiplied 
by  the  average  face  area  to  complete  the  calculation  for 
that  face  (note  that  edge  fields  are  not  required).  (2)  The 
vector  fields  at  two  vertices  are  initially  used  to  construct 
an  approximation  for  the  field  along  the  connecting  edge. 
The  cross  product  is  then  expanded  as  a  vector  triple 
product  at  each  vertex,  and  finally  an  average  value  for 
the  area  integral  is  constructed  by  summing  over  all  verti¬ 
ces  of  the  face.  Although  the  evaluation  described  by  the 
first  scheme  is  simpler  to  implement,  it  has  been  found  to 
lead  to  degraded  accuracy  and  increased  late-time  insta¬ 
bility  for  all  but  the  simplest  of  cell  shapes  (it  is  recom¬ 
mended  only  for  mapped-mesh  grids  with  limited  cell 
distortion).  The  second  scheme  has  been  found  to  be  gen¬ 
erally  more  accurate  and  robust,  but  it  also  has  been  found 
to  suffer  similar  difficulties  for  complex  cell  shapes. 
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4.2  Volume-Integral  Approximation 
The  volume  integrals  in  Eqs.  (4,3)  are  easily  evaluated  by 
assuming  the  vector  field  remains  constant  throughout  the 
cell  volume.  This  is  the  approach  used  throughout  the 
paper  and  leads  to  the  following  equations: 


JJh*  •!.?<«?,  (4.5a) 


-Al.  ffn?;  xE'  dA, -{iii  -hJ-hI-  -  o„,.  H;  (4.4a) 

v!'  I  dt 


T,  <P  - 

^hP  Up 

(e/  -  )MjE'  .  dAl^  -  S {{e'  •  (4.5b) 


e;~E,+a^;E  /  = 


-ll  Jjn5-pXH*^p-(sy-Ej^E'.-a,^.E’  (4.4b) 


A  first-order  approximation  to  the  vector  fields  within 
ceils  can  be  constructed  by  expanding  the  fields  in  a  first- 
order  Taylor  series  and  applying  the  relationship 
JIJvvj/  dV  -  jjvj;  ndA,  where  i}/  denotes  a  scalar  field 
and  n  denotes  an  outward  normal.  This  relationship  en¬ 
ables  the  field  derivatives  to  be  easily  evaluated.  The 
expansion  has  been  implemented  on  a  variety  of  cell  types 
and  has  been  found  to  have  a  minor  effect  on  the  accuracy 
of  the  geometries  examined.  The  improvement  has  not 
been  found  to  outweigh  the  added  implementation  com¬ 
plexity,  but  an  exhaustive  study  has  not  been  done.  It  is 
noted  that  the  expansion  may  be  useful  on  the  dual  grid 
when  the  dual-cell  centroid  is  far  removed  from  the  corre¬ 
sponding  primary-cell  vertex. 


Here,  H^yEp  denote  the  (face)  normal  component  of  the 
scattered  magnetic  and  electric  fields  as  obtained  from 
contour  integration  (similar  to  traditional  FDTD),  The 
fields  are  assumed  to  remain  constant  over  the  area  inte¬ 
grals.  Because  the  evaluation  faces  are  located  at  the  in¬ 
terface  between  two  cells,  both  electric  and  magnetic  ma¬ 
terials  are  volumetric  averaged  (denoted  by  the  symbol, 
over  all  materials  and  a  superscript,/,  to  distinguish  “face 
average”  (cf.  Sections  4.5, 4.6)). 

The  (scattered)  magnetic  field  projected  along  dual-edge 
s/  is  finally  constructed  by  forming 


rn?,,sf)  (4.6) 

Vi+Vk)  ^  ^ 


4.3  Contour-Integral  Correction 
A  further  improvement,  and  natural  extension,  to  an  edge- 
projection  scheme  is  made  by  correcting  the  normal  com¬ 
ponent  of  the  average  field  along  an  edge  by  using  con¬ 
tour  integration  around  the  face  pierced  by  this  edge. 
Note  that  this  correction  will  ensure  that  the  integral  of 
the  field  divergence  over  each  independent  cell  is  constant 
in  time  and  that  the  FVTD  algorithm  will  reduce  identi¬ 
cally  to  the  traditional  FDTD  form  when  the  face-normal 
is  parallel  to  the  edge  piercing  the  face.  Specifically,  for 

primary-face  (/,  /)  pierced  by  dual-edge  s^,  and  dual-face 
k 

(j,  p)  pierced  by  primary-edge  s^  (cf.  Fig.  4.1): 


The  (scattered)  electric  field  projected  on  primary-edge 
is  similar,  as  are  the  projected  fields  based  on  the  total- 
field  formulation.  Linear,  instead  of  volumetric,  averag¬ 
ing  of  the  vector  fields  has  also  been  used  successfully, 
and  has  been  found  to  provide  improved  accuracy  and 
stability  on  the  dual  grid  when  adjoining  cells  differ  con¬ 
siderably  in  their  volumes.  The  use  of  the  contour  correc¬ 
tion  has  been  found  to  be  most  significant  when  cells  pos¬ 
sess  non-planar  faces,  which  typically  occurs  on  the  dual 
grid.  Although  formulated  and  implemented  differently 
here,  Eq.  (4,6)  leads  to  a  scheme  that  is  identical  to  the 
modified  finite-volume  algorithm  [10]  (which  may  not  be 
obvious).  Note  that  the  FVTD  algorithm  of  Yee  [14]  in¬ 
corporates  a  relation  similar  to  Eq.  (4.6)  only  on  the  pri¬ 
mary-grid  edges  that  connect  to  a  conducting  surface. 
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It  is  noted  that  for  highly  distorted  meshes,  primary  edge, 
s* ,  or  dual  edge,  s/,  may  not  pierce  the  common  face 
shared  by  two  adjoining  cells  and  can  lead  to  computa¬ 
tional  errors.  These  “edge/face  misses”  can  be  minimized 
by  ensuring  that  the  primary  cells  are  reasonably  well 
shaped,  which  can  be  accomplished  with  free-meshed 
tetrahedral  algorithms  by  placing  controls  on  the  cell  as¬ 
pect  ratio,  stretch  and/or  distortion. 


4.4  Time  Integration 

The  finite-volume  method  can  develop  a  late-time  weak 
instability  that  may  manifest  itself  after  several  thousand, 
or  perhaps  only  several  hundred,  time  steps  depending  on 
grid  complexity.  As  previously  noted,  the  rate  of  late¬ 
time  grovnh  is  affected  by  the  manner  in  which  the  spatial 
aspects  of  the  algorithm  are  implemented.  The  discrete 
surface  integration  (DSI)  method  [II]  represents  another 
spatial  variation  that  improves,  but  may  not  eliminate, 
non-physical  growth  on  complex  grids. 

Late-time  growth  can  often  be  completely  controlled  by 
using  dissipative  time  integration  (it  is  noted  that  dissipa¬ 
tive  time  integration  is  used  to  stabilize  the  upwind 
schemes  used  in  CFD-based  formulations).  A  desirable 
dissipative  scheme  behaves  as  an  ideal  low-pass  filter  that 
only  attenuates  frequencies  above  the  resolution  of  the 
grid;  however,  such  a  filter  is  not  practical  to  implement. 
An  approximation  to  the  ideal  filter  is  described  below. 

A  simple  time-averaging  technique  similar  to  that  used  to 
improve  the  stability  of  the  transient  electric- field  integral 
equation  [23]  has  been  found  to  considerably  improve  the 
late-time  stability  of  the  finite-volume  method  on  com¬ 
plex  grids.  The  DSI  technique  will  also  benefit  from  the 
following  scheme,  but  generally  later  in  time. 

The  traditional  second-order  centered  time  difference 
(applied  to  the  scattered  magnetic  field)  is  written  as 


n+3/2  /  xn+l/2 


dt 


/=(«+!)  A  r 


A/ 


(4.7) 


xn+I/2 


A  time  average  for  j  is  the  following 

/  x/j+l/2 

("0  ■ 

I  ( /  Nn+3/2  /  Nn+I/2  /  \n-l/2 

M-O  -(-0  •(■■:) 


(4.8) 


where  a  >  2 .  Time  discretization  of  (4.4a)  with  (4.7)  and 
(4.8)  leads  to  the  advancement  form 


n+3/2 

B 

(h!) 

a  +2. 

B 

(  s 

Nn+1/2  / 

a  +2 

J  < 

dt 
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(4.9) 


where  A  = 


,5  = 


The  vector 


electric  field  is  similar,  as  are  the  advancement  equations 
for  a  total-field  formulation  and  also  Eqs.  (4.5).  For  the 
case  a  =  2,  the  time  average  provides  a  cosine  frequency 
profile.  For  values  greater  than  2  the  profile  increasingly 
flattens,  such  that  as  a  oo  Eq.  (4.9)  returns  to  the  usual 
centered  time-difference  form  and  offers  no  dissipation. 


The  grid  complexity  and  the  level  of  cell  distortion  gen¬ 
erally  dictate  how  small  a  must  be  chosen  to  obtain  long¬ 
term  (tens-of-thousands  of  time  steps)  stability.  Grids  that 
require  values  in  the  range  of  2-3  generally  possess  very 
poorly  shaped  cells  and/or  a  very  wide  range  between  the 
maximum  and  minimum  edge  lengths  (>  40  to  1).  It  may 
not  be  possible  to  long-term  stabilize  these  grids  and  re¬ 
generation  of  the  grid  is  often  required.  Mapped-mesh 
grids  have  been  found  to  require  smaller  values  of  a  than 
grids  generated  using  free-meshed  tetrahedrons.  This  is 
consistent  with  the  previous  comments  that  mapped 
meshes  often  possess  greater  cell  stretch  and  distortion. 
Stabilization  values  in  the  range  of  4-6  are  typical  for 
mapped  meshes.  Tetrahedral  primary  meshes  are  gener¬ 
ally  long-term  stable  using  the  FVHG  default  value  of  10 
(an  exception  is  when  there  are  wide  variations  in  the 
maximum-to-minimum  edge  lengths,  which  is  presently 
under  investigation). 

Figure  4.2  shows  results  for  back  scattering  from  a  finite 
cylinder  with  flat  end  caps.  The  grid  was  generated  using 
mapped-mesh  techniques  and  the  primary  grid  contained  a 
highly  distorted  collection  of  wedges  and  hexahedrons. 
Curve  1  is  the  FVHG  result  using  a  ->  oo  in  the  unstruc¬ 
tured  grid  (unstable  at  0.6  ps),  curve  2  is  based  on  the 
non-dissipative  DSI  algorithm  (unstable  at  2  ps),  while 
curve  3  uses  FVHG  with  a  =  4.  Curve  3  represents 
21,000  time  steps  in  the  unstructured  mesh.  Up  to  the 
DSI  instability,  curves  2  and  3  are  indistinguishable. 
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Fig.  4.2.  Far  back  scattered  field  from  a  PEC  cylinder.  Axial 
polarization. 


4.5  Electric  Materials 

The  average  permittivity  on  dual  face,/?,  is  defmed  by 


(4.10) 


between  primary  cells,  /  and  k.  The  average  permeability, 
,  at  this  face  is  defined  by 

ir/  =  (v!"  n,.  +  vj;  (4.12) 

is  defined  similarly.  Fig.  4.4  shows  the  FVHG  bis¬ 
tatic  RCS  for  a  sphere  of  radius,  a,  with  p  =  4  The 
grid  was  the  same  as  that  used  in  Section  4.5. 


where  the  sums  are  taken  over  the  primary  cells  associated 
with  the  dual  face.  The  average  permittivity  for  dual  cell, 
y,  is  then  constructed  by  forming 


where  Nj  denotes  the  number  of  dual  faces  associated 
with  dual  cell,/.  The  average  electric  conductivities,  O'g^. 

and  are  similarly  defined.  Fig.  4.3  shows  the 

FVHG  bistatic  radar  cross  section  (RCS)  for  a  sphere  of 
radius,  a,  with  e  =  4  Sq.  The  primary  grid  contained 
144^000  free-meshed  tetrahedrons,  and  the  dual  grid  con¬ 
tained  26,000  cells.  The  combined  number  of  edges 
(degrees  of  freedom)  on  both  grids  was  approximately 
500,000.  The  incident  field  was  (p  polarized  and  cr^  is 
shown. 

4.6  Magnetic  Materials 

To  obtain  the  vector  magnetic  field  at  the  primary-cell 
centroids  (cf.  Eqs.  4.3a,  4.4a)  no  material  averaging  is 
required.  For  Eq.  4.5a,  primary  face,  /,  is  assumed  to  fall 


Fig.  4.3.  Bistatic  RCS  for  a  dielectric  sphere,  k,  denotes  frce-space 
wavenumber. 


Fig.  4.4.  Bistatic  RCS  for  a  magnetic  sphere. 


4.7  Surface  Impedance  Boundary  Condition 
To  reduce  total  cell  count  when  thin-material  coatings  are 
present,  surface-impedance  boundary  conditions  are  re¬ 
quired.  A  leading-order  impedance  boundary  condition 
(IBC)  using  a  simple  3x3  matrix  inversion  for  surface 
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cells  was  introduced  in  [12]  based  on  a  total-field  formu¬ 
lation  (which  is  easily  extended  to  the  scattered-field 

form).  The  approximation  simply  replaced  -n  xE^  in  the 
area  integrals  for  the  cell  faces  on  the  object’s  surface  by 

n  X  n  X  H -  ,  where  Z^  denotes  the  local  impedance  and 
n  denotes  an  outward  local  surface  normal.  The  approxi¬ 
mation  was  developed  for  mapped-mesh  hexahedral  cells 
that  form  a  regular,  “stacked”,  cell  pattern  to  the  surface 
of  the  scatterer.  Representative  results  for  a  resistive 
sphere  (with  Z  =  Z^/377)  are  shown  in  Fig.  4.5;  the  grid 
was  generating  using  mapped-meshing  techniques. 


Fig.  4.5,  Monostatic  RCS  for  an  impedance  sphere. 

Although  the  technique  works  extremely  well  for  such 
mapped-meshes,  it  has  subsequently  been  found  to  greatly 
underestimate  the  effect  of  the  coating  when  using  free- 
meshed  tetrahedral  grids.  A  possible  explanation  is  that 
the  formulation  modifies  the  surface  integral  over  the  tan¬ 
gential  surface  face  without  explicitly  modifying  the 
edges  bounding  this  face.  Tetrahedral  grids  lack  the  or¬ 
dered  cell  pattern  of  mapped-meshes,  and  elements  may 
possess  surface-tangential  edges  without  possessing  a 
tangential  surface  face.  Thus,  an  edge-based  generaliza¬ 
tion  appears  to  be  necessary  and  is  presently  under  inves¬ 
tigation.  The  alternative  finite-volume  IBC  formulations 
described  in  [24]  are  similarly  not  expected  to  be  directly 
applicable  to  ffee-meshed  tetrahedral  grids. 

4.8  Miscellaneous  Results 

Geometry,  measurements,  and  method-of-moment  (MoM) 
calculations  for  the  monostatic  RCS  of  the  9.936-mch 


NASA  almond  were  presented  in  [25].  The  FVHG 
analysis  of  this  object  is  presented  here.  Dimensions,  ori¬ 
entation,  and  illumination  are  identical  to  those  described 
in  [25];  the  reader  is  referred  to  this  reference  for  details. 
The  upper  half  of  the  almond  embedded  in  a  rectangular 
box  is  shown  in  Figs.  4.6a,b.  Note  the  closeness  of  the 
box  to  the  almond’s  surface.  A  tetrahedral  grid  of  the  iso¬ 
lated  almond  is  shown  in  Fig.  4.7.  The  total  tetrahedral 
grid  included  approximately  65,000  elements,  of  which 
15,000  filled  the  almond’s  interior.  Based  on  a  maximum 
(over  both  grids)  average  edge  length,  the  grid  provides 
13  “cells”  per  wavelength  at  3.5  GHz.  The  monostatic 
RCS  at  1.19  GHz  for  the  vertical/vertical  (VV)  and  hori¬ 
zontal/horizontal  (HH)  cases  [25]  are  shown  in  Fig.  4.8. 
Measured  data  are  only  provided  at  5-degree  intervals. 
Although  the  HH  FVHG  results  differ  from  the  measure¬ 
ments  by  2-4  dBSM  (dB  square  meters)  for  certain  angles, 
the  results  are  globally  very  close  to  the  published  MoM 
calculations  for  this  geometry  (included  in  Fig.  4.8). 


Fig.  4.6a,  NASA  almond  in  box  (top  half). 


Fig.  4.6b.  Top  half  of  embedded  almond  (end  view). 


Fig.  4.7.  Tetrahedron  grid  of  isolated  almond. 
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Fig.  4.8.  Monostatic  RCS  for  9.936-inch  NASA  metallic  almond, 
5.0  Conclusions 

The  FVHG  algorithm  provides  a  seamless  connection 
between  unstructured  and  structured  grids  that  is  free  from 
spatial  interpolation  across  grid  boundaries.  By  treating  a 
non-orthogonal  region  as  a  local  problem  within  a  struc¬ 
tured  computational  volume,  computer  resources  are 
greatly  reduced  while  adding  considerable  modeling 
flexibility.  Multi-material  regions  are  easily  accommo¬ 
dated  within  the  unstructured  grid.  The  FVHG  algorithm 
was  designed  as  a  general-purpose  solver  and  is  presently 
being  used  for  applications  ranging  from  microelectronics 
to  radar  cross  section. 

The  FVHG  technique  accommodates  cells  of  arbitrary 
shape.  Indeed,  although  the  primary  grid  may  contain 
only  tetrahedral  and/or  hexahedral  elements,  the  dual  grid 
may  have  elements  with  forty  or  more  faces,  each  with 
numerous  edges.  The  FVTD  algorithm  used  with  FVHG 
incorporates  an  FDTD  correction  along  with  late-time- 
stabilizing  time  integration;  this  approach  has  been  found 
to  be  accurate  and  generally  stable  even  for  very  complex 
grids.  Simpler  FVTD  algorithms  can  be  implemented,  but 
their  accuracy  and  late-time  stability  have  been  found  to 
suffer  when  the  grid  becomes  increasingly  complex. 
Formulations  that  succeed  with  one  cell  type  applied  to 
certain  geometries  may  experience  difficulties  with  other 
cell  types  and/or  other  geometries.  The  authors  know  of 
no  FVTD  algorithm  that  is  guaranteed  to  provide  long¬ 
term  stability  for  arbitrary  grids. 

The  free-meshing,  tetrahedral-element  application  of  the 
FVHG  algorithm  presented  here  provides  a  powerful 
transient  analysis  tool.  Although  the  unstructured  region 
requires  considerably  more  computer  resources  than  the 
FDTD  grid,  the  rapid  transition  from  the  unstructured 


mesh  to  structured  FDTD  permits  efficient  use  of  com¬ 
puter  resources.  The  transition  can  be  accomplished 
within  only  1-2  FDTD  cells  from  the  modeled  geometry. 
This  rapid  transition  has  been  realized  with  the  solid- 
modeling/fmite-element  software,  I-DEAS.  However,  to 
permit  the  efficient  generation  of  very  large  grids  (>l-2 
million  cells),  parallelization  of  the  grid-generation  algo¬ 
rithms  will  be  necessary. 

The  present  implementation  of  the  FVHG  algorithm  en¬ 
closes  the  entire  modeled  geometry  in  a  single  rectangular 
box.  To  further  increase  efficiency,  various  sized,  inter¬ 
connected  rectangular  boxes  may  be  desirable.  These  are 
easily  included  at  the  solid-modeling/primary-grid- 
generation  level.  The  only  added  complexity  is  identifi¬ 
cation  of  the  location  where  the  unstructured/structured 
interfaces  exist.  This  straightforward  extension  is  pres¬ 
ently  under  investigation. 
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Abstract 

One  method  for  reducing  the  radar-cross  section  of  objects  such  as  aircraft  and  missiles  is  the 
application  of  a  lossy  coating.  Computing  scattering  from  targets  coated  with  dielectric/magnetic 
materials  is  challenging  for  many  computational  methods  due  to  the  reduced  wavelengths  of 
an  incident  field  inside  the  coating.  These  smaller  wavelengths  require  increased  sampling  of 
the  fields  and  thus  more  memory  and  computation  time.  A  technique  for  implementing  this 
calculation  without  greatly  increased  memory  requirements  or  computation  times  has  recently 
been  developed  using  a  finite-difference,  time-domain  (FDTD)  code  and  tested  in  one-,  two-  and 
three- dimensions.  The  method  requires  knowledge  of  the  frequency  behavior  of  the  complex 
permittivity  and  permeability  and  the  thickness  of  the  dielectric  coating.  The  surface  impedance 
at  the  surface  of  the  coating  is  computed  based  on  the  given  information  and  then  approximated 
using  a  summation  of  causal  functions.  The  approximated  impedance  is  Z-Transformed  and  added 
to  the  FDTD  code  in  special  update  equations  for  the  fields  at  the  surface  of  the  coating.  No 
computations  are  required  inside  the  coatings,  so  the  FDTD  grid  can  be  sized  based  on  the  free 
space  wavelength.  The  result  obtained  is  valid  over  the  entire  frequency  range  of  interest  assuming 
that  the  approximated  surface  impedance  is  a  good  match  over  the  entire  range.  Comparisons 
with  analytical  solutions  in  one-dimension  and  measurements  in  three- dimensions  show  very  good 
agreement. 

Introduction 

The  idea  of  using  a  surface  impedance  which  describes  the  interaction  of  the  tangential  fields 
on  the  surface  of  an  object  as  a  replacement  for  computing  the  actual  fields  inside  an  object  was 
first  presented  by  Leontovich[l]  in  1948.  The  advantage  of  such  a  method  is  that  when  materials 
with  high  values  of  permittivity,  permeability,  and/or  conductivity  are  encountered,  the  fields 
inside  the  material  need  not  be  calculated.  If  the  calculations  within  the  lossy  dielectric/magnetic 
material  are  replaced  by  a  relationship  at  the  surface  of  the  material,  a  great  savings  in  computer 
memory  and  computation  time  can  be  obtained.  Leontovich’s  idea  was  not  greatly  pursued  (with 
the  exception  of  a  paper  by  Senior [2]  in  1960)  until  computer  methods  became  more  common  in 
the  early  1990’s.  Several  relevant  papers  have  since  been  published  by  Tesche[3],  Riley  and  Turner 
[4],  Maloney  and  Smith[5]  and  Beggs  et  al.[6].  The  work  by  Maloney  and  Smith  [5]  and  Beggs  et 
al  [6]  are  most  related  to  the  current  topic  of  scattering  from  coated  targets. 

Previous  authors  ([5],  [6])  have  proposed  methods  of  applying  the  surface  impedance  concept 
to  FDTD.  The  difference  between  the  work  of  these  authors  and  this  paper  is  that  they  made 
the  assumption  that  any  fields  transmitted  into  the  dielectric  would  be  dissipated  within  the 
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material  and  no  energy  would  be  reflected  at  the  terminating  surface.  This  assumption  limits 
the  application  of  their  approaches,  but  it  also  allows  the  surface  impedance  to  be  described^  in 
a  relatively  simple  expression.  In  the  case  of  a  thin  coating  over  conductor,  their  assumption 
of  no  reflection  from  the  terminating  surface  can  not  be  applied  because  the  layer  is  typically 
thin  enough  for  significant  energy  to  reach  the  conducting  surface  and  be  reflected.  Indeed,  thin 
coatings  that  are  designed  to  absorb  most  of  the  incident  energy  include  the  effects  of  multiple 
reflections  from  the  conductor  surface. 

Implementation 

The  surface  impedance  vs.  frequency  for  a  thin  lossy  layer  over  a  perfect  conductor  is  relatively 
complicated  and  therefore  it  is  very  difficult  to  use  the  methods  suggested  by  [5]  and  [6].  As  an 
alternative  to  these  methods,  a  new  approach  was  used  in  which  the  impedance  expression  is 
replaced  by  an  approximation.  The  approximation  is  made  using  a  least- squares  curve  fitting 
algorithm.  Basis  functions  are  given  with  variable  parameters,  and  these  parameters  are  adjusted 
for  the  least  amount  of  difference  between  the  actual  data  (the  impedance)  and  the  approximation. 
The  basis  functions  chosen  must  be  causal  since  these  functions  will  be  transformed  from  the 
frequency  domain  into  the  time  domain  and  non-causal  functions  could  cause  the  FDTD  code  to 
become  unstable. 

The  material  parameters  for  a  commercially  available  radar  absorbing  material  (RAM)  are 
shown  in  Figure  1.  This  is  GDS  material  available  from  Emerson  and  Cuming.  The  relative  per¬ 
mittivity  over  the  frequency  range  of  interest  (1  to  18  GHz)  varies  from  17  to  12  while  the  relative 
permeability  is  as  high  as  4.5  at  some  frequencies.  Both  the  permittivity  and  the  permeability 
have  loss  terms  (imaginary  parts)  associated  with  them.  The  surface  impedance  of  this  material 
is  computed  assuming  that  the  coating  is  applied  to  an  infinite  conducting  sheet,  so  transmission 
line  methods  are  used.  The  surface  impedances  and  the  resulting  approximations  obtained  for  a 
0.09”  layer  of  this  material  is  shown  in  Figure  2.  Getting  a  nearly  exact  fit  between  the  actual 
impedance  and  the  approximation  is  difficult  and  may  require  a  large  number  of  basis  functions 
in  the  curve  fit.  This  results  in  added  complexity  and  the  possibility  that  the  calculations  will  go 
unstable.  It  was  found  that  a  reasonably  close  fit,  as  shown  in  Figure  2,  provided  stability  and 
very  good  agreement  using  only  two  basis  functions. 

To  implement  the  curve-fit  approximation  of  the  surface  impedance  into  the  FDTD  algorithm 
requires  the  application  of  a  transformation  since  the  impedance  expression  is  in  the  frequency 
domain  and  the  calculations  must  be  done  in  the  time  domain.  Since  the  FDTD  code  is  very 
similar  to  a  digital  system  due  to  the  discrete  sampling  times  at  which  the  fields  are  calculated,  it 
was  decided  that  the  Z- Transform  would  be  used.  Previous  work  by  Sullivan  [8]  has  shown  that 
the  Z- Transform  is  viable  alternative  to  the  Fourier  transform  for  FDTD  applications. 

Once  the  approximated  impedance  has  been  transformed,  it  must  be  added  to  the  FDTD  field 
update  equations.  The  electric  field  tangential  to  the  surface  of  the  scatterer  is  defined  by  the 
expression  [3] 

{ZsJs)  (1) 

Js  =  hxH,  (2) 

■where  =  the  surface  total  electric  field,  Jg  =  the  surface  current,  Hg  =  the  surface  magnetic 
field,  and  Zg  =  the  surface  impedance.  From  this  relationship,  an  expression  for  the  tangential 
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electric  field  just  inside  the  surface  of  the  coating  can  be  found  and  substituted  into  the  one- 
dimensional  FDTD  magnetic  field  update  equation 

E^ik  -f  1)  -  E^{k)  Hy'^^k  +  D-nr^jk  +  l) 

- a; - At  ^  ^ 

as  a  replacement  for  the  E^{k  +  1)  term.  Here  Az  is  the  spatial  increment  and  At  is  the  time 
step  size.  In  two  and  three  dimensions,  a  similar  substitution  would  be  required  for  all  the  electric 
fields  tangential  to  the  surface  with  special  attention  paid  to  the  vector  nature  of  Equation  1  to 
obtain  the  proper  sign.  Fields  on  the  surface  of  the  scatterer  will  use  this  special  equation  while 
all  exterior  fields  will  use  the  usual  FDTD  equations.  Any  fields  interior  to  the  surface  do  not 
affect  the  calculation  results  are  updated  as  if  they  were  perfect  conductor. 

An  FDTD  code,  modified  as  described  above,  was  written  for  one,  two,  and  three  dimensional 
problems.  The  one  dimensional  code  was  tested  against  analytically  computed  reflection  coeffi¬ 
cients  with  excellent  results.  Analytical  solutions  in  two  dimensions  are  difficult  to  compute,  so 
the  surface  impedance  code  was  validated  against  an  unmodified  FDTD  code  which  used  very 
small  cells  and  was  re-run  for  every  frequency.  Finally,  the  three  dimensional  results  were  com¬ 
pared  to  measurements  of  8  inch  square  conducting  plates  and  a  generic  model  missile  (Figure 
6).  The  measurements  of  both  targets  were  performed  a  several  incidence  angles  with  the  most 
interest  being  in  the  RCS  of  the  missile  at  angles  near  the  nose. 

The  missile  (Figure  3)  was  constructed  from  a  sealed  copper  tube  and  coated  with  a  0.025” 
layer  of  GDS  RAM  with  material  parameters  shown  in  Figure  1.  The  ends  of  the  cylinder  are  not 
coated.  It  has  fins  made  from  a  high  permittivity  (relative  =  10.4)  dielectric  that  are  attached 
to  the  copper  cylinder  through  slits  cut  into  the  RAM  coating.  There  is  also  a  dielectric  (relative 
permittivity  =  2.3)  nose  cone  attached  directly  to  the  end  of  the  copper  cylinder.  An  identical 
missile  model  but  without  the  GDS  RAM  coating  was  also  constructed  and  measured  to  gauge 
the  effect  of  adding  the  coating. 

The  FDTD  approximation  of  the  missile  of  Figure  3  used  cubical  FDTD  cells  2  mm  on  a  side. 
The  dimensions  of  the  FDTD  problem  space  were  84  x  84  x  347  cells.  Due  to  the  thinness  of 
the  fins,  the  permittivity  was  reduced  using  a  sheet  impedance  approximation.  This  resulted  in  a 
relative  permittivity  of  4.5  instead  of  10.4.  A  minimum  border  of  20  cells  surrounded  the  missile 
on  all  side  which  terminated  in  a  second  order  stabilized  Liao  boundary  condition  [9], [10].  A 
modulated  Gaussian  pulse  plane  wave  which  had  frequency  content  over  the  range  of  1  to  14  GHz 
was  used  as  the  excitation.  Plane  wave  incident  angles  from  0  to  90  degrees  measured  from  the 
nose,  with  the  missile  cylinder  rotated  horizontally,  were  computed  using  2000  time  steps.  The 
time  step  size  was  3.85  ps. 

As  was  mentioned  previously,  FDTD  calculations  of  an  uncoated  missile  were  performed  and 
measurements  were  made  so  that  the  effects  of  the  RAM  coating  could  be  seen.  Figure  4  shows 
a  comparison  of  the  measurements  and  FDTD  calculations  in  a  plot  of  the  RCS  of  the  un coated 
missile  at  4  degrees  off  the  nose  (Vertical  Polarization).  The  coated  missile  has  a  very  different 
response,  especially  at  the  higher  frequencies,  as  can  be  seen  in  Figure  5.  The  frequency  dependent 
surface  impedance  FDTD  code  (FDTDRAMSI)  gives  good  results  for  the  coated  missile  over  the 
entire  range.  Additional  comparisons  at  12.6  degrees  shown  in  Figures  6-7  show  the  significant 
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effects  of  the  coating,  and  the  accuracy  of  the  FDTDRAMSI  code  in  predicting  these  effects.  The 
FDTDRAMSI  results  for  other  angles  and  horizontal  polarization  are  of  similar  quality. 

Conclusion 

An  FDTD  surface  impedance  approximation  can  be  applied  to  scattering  from  conducting 
targets  coated  with  a  thin  lossy  dielectric/magnetic  layer.  The  formulation  described  in  this 
paper  gives  broad  frequency  band  results  in  a  single  FDTD  calculation  with  only  a  minor  increase 
in  memory  and  execution  time  over  usual  FDTD.  In  comparison  to  normal  FDTD  calculations 
for  coated  targets,  the  savings  in  memory  and  execution  time  are  very  significant.  This  is  in 
large  part  due  to  the  ability  to  size  the  grid  based  on  the  free  space  wavelength  instead  of  the 
dielectric/magnetic  material  wavelength  since  no  computations  actually  take  place  inside  the 
coating.  Three  dimensional  results  for  a  complex,  realistic  target  have  been  shown  with  good 
agreement  to  measurements. 
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Figure  1:  Complex  permittivity  and  permeability  of  the  CDS  RAM  material  over  the  frequency 
range  of  interest. 
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Figure  2:  The  computed  and  approximated  surface  impedances  of  a  0.09”  layer  of  GDS  RAM 
material  on  an  infinite  PEC  plane. 
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Figure  3:  Diagram  of  the  generic  missile  used.  The  coated  version  has  a  0.025”  layer  of  GDS 
RAM  material  on  the  tube. 
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Figure  4:  RCS  of  the  uncoated  missile  of  Figure  3  at  an  incident  angle  4  degrees  off  the  nose.  The 
plot  is  for  a  vertically  polarized  incident  plane  wave  and  shows  the  comparison  of  the  measurements 
and  a  regular  FDTD  code. 
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Figure  5:  RCS  of  the  coated  missile  of  Figure  3  at  an  incident  angle  4  degrees  off  the  nose.  The  plot 
is  for  a  vertically  polarized  incident  plane  wave  and  shows  the  comparison  of  the  measurements 
and  modified  FDTD  RAM  SI  code  which  uses  the  approximated  input  impedance  to  give  broad 
band  results. 
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Figure  6:  RCS  of  the  uncoated  missile  of  Figure  3  at  an  incident  angle  12.6  degrees  off  the 
nose.  The  plot  is  for  a  vertically  polarized  incident  plane  wave  and  shows  the  comparison  of  the 
measurements  and  a  regular  FDTD  code. 


Figure  7:  RCS  of  the  coated  missile  of  Figure  3  at  an  incident  angle  12.6  degrees  off  the  nose.  The 
plot  is  for  a  vertically  polarized  incident  plane  wave  and  shows  the  comparison  of  the  measurements 
and  modified  FDTDRAMSI  code  which  uses  the  approximated  input  impedance  to  give  broad 
band  results. 
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Abstract 

A  hybrid  method  which  combines  the  FDTD  and  the  FVTD  methods  is  presented.  The  FVTD  based  on  a 
unstructured/stnictured  and  conformal  grid  is  used  in  the  near  vicinity  of  the  surface  of  a  scatterer  and  the  FDTD  is 
used  to  model  the  fields  in  the  surrounding  area.  The  two  are  coupled  together  through  interpolation.  The  vertex 
based  FVTD  allows  for  more  convenient  and  accurate  interpolations  than  a  conformal  FDTD  method.  The  hybrid 
method  is  validated  through  two  examples  --  the  RCS  of  a  PEC  sphere  and  the  RCS  of  an  ellipsoid  of  revolution  - 
by  comparison  with  the  exact  Mie  series  solution  and  with  a  MOM  code  calculation. 

1.  Introduction 

This  paper  will  gives  details  on  our  work  in  the  conformal  time  domain  technique  during  the  past  several  years.  It 
is  a  re-written  version  of  a  number  of  papers  we  have  submitted  to  the  IEEE  Transaction  on  Antennas  and 
Propagations  [1-6].  Therefore  we  shall  not  give  a  historical  account  of  the  method  nor  shall  we  discuss  the  relation 
of  our  work  with  other  published  work  in  conformal  FDTD-like  time  domain  methods  [7-20].  Such  accounts  have 
been  given  in  the  quoted  papers.  Interested  readers  are  urged  to  consult  the  references  in  [1-6], 

2.  The  FDTD  and  the  FVTD 

There  are  two  integral  forms  of  Maxwell’s  equations  that  are  useful  in  time  domain  numerical  solutions  of 
electromagnetic  problems: 


-f  S*da=  f  E*dJ 

JA  JdA 

(la) 

f  f  H^dJ 

(lb) 

Ja  JdA 

—  I  Bdv=  f  h'x.Eda 

(2a) 

Jv  Jdv 

f  f  nxNda 

Jv  J^V 

(2b) 

The  discretizations  based  on  equations  (la)  and  (lb)  have  been  commonly  known  and  will  be  referred  to  as  Finite 
Difference  Time  Domain  (FDTD,  generalized),  and  the  discretizations  based  on  equations  (2a)  and  (2b)  have  been 
known  and  will  be  referred  to  as  Finite  Volume  Time  Domain  (FVTD).  The  equations  (la)  and  (lb)  are  the 
mathematical  description  of  the  Faraday’s  law  and  Ampere’s  law,  whereas  the  equations  (2a)  and  (2b)  can  be 
obtained  through  integrating  Maxwell's  partial  differential  equations  over  a  volume.  Under  smoothness 
assumptions  they  are  equivalent.  Our  new  proposed  conformal  method  will  be  a  combination  of  FDTD  and  FVTD. 

3.  The  Leapfrog  FDTD  and  FVTD 

The  numerical  approach  we  shall  present  is  very  simple.  It  involves  no  more  than  accurate  evaluation  of  line 
integrals,  surface  integrals,  and  volume  integrals.  We  shall  use  time  leapfrog  in  updating  the  electric  field  and  the 
magnetic  field.  In  order  to  be  not  too  wordy  in  our  discussions,  we  use  the  FDTD  notations  and  it  is  advantageous 
to  introduce  the  following  terminology: 

The  line  integral  on  the  right  hand  side  of  (la)  is  the  electric  circulation  around  the  curve  3 A;  and  the  curve  3A 
is  an  electric  contour. 

The  line  integral  on  the  right  hand  side  of  (lb)  is  the  magnetic  circulation  around  the  curve  3 A;  and  the  curve 
dA  is  a  magnetic  contour. 

The  surface  integral  on  the  right  hand  side  of  (2a)  is  the  electric  vorticity  over  the  surface  3V. 

The  surface  integral  on  the  right  hand  side  of  (2b)  is  the  magnetic  vorticity  over  the  surface  3V. 

The  leapfrog  time  discretized  equations  corresponding  to  (la)-(2b)  are: 
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(3a); 

(3b); 

(4a); 


(4b). 

The  above  identities  are  subject  to  the  right-handed  orientation  of  vector  analysis.  Many  time  domain  algorithms 
can  be  obtained  by  Judicious  choices  of  the  curves  and  surfaces.  In  fact  our  updating  schemes  are  almost  self 
evident  from  equations  (la)-(2b).  In  equations  (la)-(lb)  we  notice  that  A  =J  n  da  is  the  vector  area  enclosed  by 
the  curve  3A.  From  vector  analysis  it  is  known  that  this  vector  area  is  independent  of  the  area  A.  It  is  specified 
uniquely  by  the  curve  3A.  It  is  called  the  area  vector  A  enclosed  by  3A.  Therefore  another  convenient 
approximation  of  (3a)-(3b)  is: 


_  ^«-l/2  )  .  ^  ^  ^  •dJ 

Ar  j 

dA 


A-  j •dT 

dA 


(5a) 


(5b) 

Now  the  FDTD  and  FVTD  algorithm  can  be  loosely  stated: 

For  FDTD 

#  Use  the  electric  circulation  around  a  curve  to  update  the  component  of  the  magnetic  field  along  the  area  vector 
of  the  surface  enclosed  by  the  curve. 

#  Use  the  magnetic  circulation  around  a  curve  to  update  the  component  of  the  electric  field  along  the  area  vector 
of  the  surface  enclosed  by  the  curve. 

For  FVTD 

#  Use  the  electric  vorticity  over  a  surface  to  update  the  magnetic  vector  in  the  center  of  the  volume  enclosed  by 
the  surface. 

#  Use  the  magnetic  vorticity  over  a  surface  to  update  the  electric  vector  in  the  center  of  the  volume  enclosed  by 
the  surface. 

The  accuracy  of  the  algorithm  depends  on  how  carefully  we  choose  the  curves  enclosing  the  surface  and  the 
surfaces  enclosing  the  volume.  For  example  the  electric  or  magnetiic  field  variables  in  (4a)-(4b)  should  be 
evaluated  as  close  to  the  cenuoid  of  the  area  as  possible.  To  be  explicit  let  us  consider  the  curve  (Figure  1)  dA  = 

(ij^)  _>(i+i  j^)  _>(i+i  j+l,k)  — >(i,j+l,k)  — >(i  j,k).  The  area  vector  enclosed  by  the  curved  is:  ^  ~  z  AxAy  ^ 
The  centroid  of  the  area  is  (i+1/2,  j+1/2,  k)  and  the  electric  circulation  is  approximately: 

•4-1/2, y.*)Ax  +  £y'‘(i  +  l,;+l/2,A:)Aj-£x'‘(/  +  l/2,i  +  l,*)Ax-E>"(i,y  +  l/2,fc)Ay. 
dA 

Substituting  these  approximations  in  (4a)  one  obtains  the  following  equation: 
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Equation  (6)  is  one  of  the  FDTD  equations  to  update  the  component  of  the  magnetic  field.  But  the  equations  (3a)- 
(3b)  can  ht  applied  to  arbitrary  curve  even  when  the  curve  is  not  planar!  This  is  the  power  of  the  area-curve 
integral  form  of  the  Maxwell's  equations.  If  we  include  sources  on  the  right  hand  side  of  (la) -(lb),  the  integral 
form  can  be  used  to  model  circuit  elements  as  well.  To  facilitate  the  discussion  of  the  FVTD  algorithm  we  use  the 
rectangular  grid  topology.  We  define  (i,  j,  k)  =  (xj,  yj,  zk).  The  grid  points  need  not  be  equally  spaced.  This  grid 
will  be  called  an  electric  grid  ( see  Figures  2  and  3  for  electric  elements).  These  vertices  are  the  electric  vertices. 
Located  at  the  electric  vertices  is  the  electric  vector  E  (we  store  the  three  rectangular  components  of  the  electric 
field  there).  A  computational  element  is  a  distorted  cube,  called  an  electric  cube  or  an  electric  element.  TTie 
faces  of  an  electric  element  are  the  electric  faces.  In  rectangular  grid  an  electric  cube  is  an  FDTD  cube.  The 
"center"  of  an  electric  cube  is  denoted  by  (i+1/2,  j+1/2,  k+1/2).  This  will  be  called  a  magnetic  vertex.  Two 
magnetic  vertices  are  connected  if  the  two  corresponding  electric  elements  share  a  common  face.  We  thus  generate 
a  collection  of  magnetic  elements;  and  the  collection  constitutes  the  magnetic  grid.  Located  at  the  magnetic 
vertices  are  the  magnetic  vector  B  (or  H).  The  updating  of  E  and  H  (  we  assume  that  B=|aH  and  D-eE  for 
simplicity)  are  carried  out  with  the  help  of  equations  (4a)  and  (4b).  The  volume  in  equation  (4a)  is  an  electric 
element  and  the  volume  in  equation  (4b)  is  a  magnetic  element.  If  the  faces  of  an  electric  element  are  not  coplanar, 
the  volume  of  the  electric  element  is  not  defined.  So  far  in  our  discussion  an  electric  face  has  four  vertices  which 
may  not  be  coplanar.  We  make  the  electric  face  definite  by  adding  the  barycenter  of  the  four  vertices.  The 
rectangular  coordinates  of  the  barycenter  is  the  average  of  the  rectangular  coordinates  of  the  four  vertices.  Now  the 
specific  electric  face  we  use  is  the  collection  of  triangles,  each  of  which  is  formed  with  two  vertices  and  the 
barycenter  (Figure  4).  With  the  electric  faces  made  definite  we  have  now  a  definite  volume  for  the  electric 
element.  In  the  evaluation  for  the  electric  vorticity  we  have  to  make  sure  that  the  unit  surface  normal  is  pointing 
away  from  the  volume.  Therefore  the  collection  of  oriented  triangles  (  use  the  right  hand  rule:  as  the  fingers  curl 
along  the  perimeter  of  a  triangle,  the  normal  of  the  triangle  is  in  the  direction  of  the  thumb)  must  be  such  that  the 
area  vector  points  away  from  the  volume.  Thus  if  (i+1/2,  j+1/2,  k+1/2)  denotes  the  "center"  of  an  electric  element, 
the  electric  face  consisting  of  the  vertices  (i,  j,  k),  (i+1,  j,  k),  (i+1,  j+1,  k)  and  (i,  j+1,  k)  will  have  the  following 
four  oriented  triangles: 

{(i+l.j,k),(i,j,k),b),{(i+l,j+l,k),(i+l,j,k),b], 

[(i,j+l,k),  (i+l,j+l,k),b),  and  {(i,  j,k),  (i,  j+1,  k),  b} 

where  b  is  the  barycenter  which  can  be  denoted  by  (i+1/2,  j+ 1/2,  k). 

To  evaluate  the  partial  contribution  to  the  electric  vorticity  from  this  face,  we  first  define  the  value  of  the  electric 
vector  at  the  barycenter  to  be  the  average  of  its  values  at  the  four  vertices.  The  surface  integral  over  the  face  is  now 
the  sum  of  the  integral  over  the  triangles.  Over  each  triangle  we  approximate  the  integral  by  taking  the  cross 
product  of  the  approximate  value  of  the  electric  field  at  the  centroid  of  the  triangle  and  the  area  vector  of  the 
triangle. 

4.  The  Overlapping-Grid  FDTD  /  FVTD  hybrid 

Our  FDTD/FVTD  hybrid  uses  a  composite  grid  consisting  of  a  rectangular  grid  intersecting  a  body  conformal 
grid.  The  conformal  grid  consist  of  an  electric  grid  and  a  magnetic  grid.  The  electric  grid  is  a  few  layer  of  prisms 
erected  along  the  normal  of  the  scattering  surface.  Our  main  goal  is  to  make  use  of  the  triangular  surface  grid 
usually  generated  for  the  MOM  frequency  domain  integral  equation  methods.  Therefore  our  conformal  electric 
grid  will  consist  of  layers  of  triangular  prisms  (Figure  3).  No  more  than  five  layers  are  needed.  The  conformal 
electric  grid  intersects  an  outer  rectangular  grid  in  which  the  FDTD  algorithm  is  used.  Constructed  from  the 
conformal  electric  grid  is  the  magnetic  grid.  The  height  of  the  prisms  is  approximately  the  same  as  Ax,  the  spatial 
length  of  the  FDTD  grid.  The  lengths  of  the  sides  of  the  surface  triangles  are  comparable  to  Ax.  In  the  interior 
electric  vertices  or  the  magnetic  vertices  (points  which  are  neither  grid  boundary  points  nor  physical  boundary 
points)  the  FVTD  algorithm  is  used  for  updating.  The  interior  boundary  of  our  rectangular  grid  is  0.5Ax  (0.5  zone) 
to  Ax  (1  zone)  away  from  the  scatterer  (Figure  5).  The  outer  boundary  electric  field  on  the  FVTD  primary  grid  is 
obtained  through  interpolation  of  the  calculated  rectangular  FDTD  electric  field;  the  rectangular  FDTD  electric 
field  at  the  inner  boundary  of  the  rectangular  is  obtained  through  interpolating  the  calculated  FVTD  field  (Figures 
6a,b).  For  the  electric  vertices  at  the  boundary,  we  make  use  of  the  physical  boundary  condition  and  interpolation. 
This  procedure  will  be  explained  more  fully  in  the  section  on  boundary  conditions. 

5  The  Double  Interpolation 

At  each  time  step  we  cany  out  a  double  interpolation  for  the  data  from  one  grid  to  another  grid. 
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The  interpolation  from  the  rectangular  FDTD  to  obtain  the  electric  vector  at  the  outer  boundary  of  the  conformal 
body  grid  is  discussed  first  From  the  rectangular  coordinate  of  an  outer  boundary  point  we  can  find  out  to  which 
FDTD  cube  it  belongs  and  where  in  this  FDTD  cube  it  is  located.  The  electric  vector  at  the  vertices  of  this  FDTD 
cube  is  obtained  first  through  interpolation  from  the  FDTD  edge  components.  The  values  of  the  electric  vector  at 
the  vertices  of  the  electric  cube  is  now  used  to  interpolate  the  electric  vector  at  the  outer  boundary  vertex  through 
bilinear  interpolation. 

To  interpolate  for  the  rectangular  component  of  the  electric  field  at  the  midpoint  of  an  edge  in  the  interior 
boundary  of  the  rectangular  grid,  we  find  out  which  triangular  prism  this  midpoint  belongs  to  and  where  in  this 
prism  this  point  is  located.  It  is  a  little  tricky  for  this  process.  We  therefore  explain  in  a  little  more  detail.  Let  a 
a  point  where  a  rectangular  component  of  the  electric  field  is  to  be  interpolated.  We  first  enumerate  those  electric 
vertices  which  is  closest  to  the  point  a.  We  then  enumerate  the  triangular  prisms  sharing  the  point  a.  One  of  these 
prisms  will  contain  the  point  a.  The  boundary  of  the  prisms  are  faceted  (the  upper  and  lower  faces  are  triangles, 
and  each  of  the  lateral  faces  is  the  union  of  four  triangles  each  of  which  has  the  barycenter  as  a  vertex).  Each  of 
these  prisms  has  a  definite  volume.  For  each  of  the  triangles  from  the  collection  of  triangles  forming  the  boundary 
of  one  of  these  prisms,  construct  a  pyramid  with  the  addition  of  point  a.  The  sum  of  the  volumes  of  these  pyramids 
will  be  larger  than  the  volume  of  the  prism  unless  the  point  a  is  in  the  closure  of  this  prism.  In  that  case  the  sum  of 
the  volumes  of  the  pyramids  equals  the  volume  of  the  prism.  The  point  a  may  lie  at  the  boundary  of  more  than  one 
prisms  but  we  need  only  to  find  one  such  prism.  Having  found  one  prism  the  point  a  belongs  to,  we  subdivide  the 
"vertical"  edges  of  the  prism  into  a  large  number  of  points  with  equal  but  small  distance  between  two  consecutive 
points  (Figure  7).  We  thus  establish  equal  discrete  "venical"  coordinates  along  the  vertical  edges.  The  three  points 
on  the  three  vertical  edges  with  the  same  vertical  coordinate  forms  a  triangle  (Figure  7).  We  form  a  pyramid  with 
each  triangle  and  the  point  a.  We  compute  the  volume  of  each  of  these  pyramids  and  find  one  with  the  minimum 
volume!  Call  the  base  triangle  a  minimum  triangle.  The  "vertical"  coordinate  of  the  point  a  is  approximately  the 
"vertical"  coordinate  of  the  minimum  triangle.  In  other  words,  the  point  a  is  approximately  contained  in  this 
minimum  triangle.  Now  we  can  find  the  area  coordinates  of  the  point  a  in  this  minimum  triangle.  The  value  of  the 
electric  vector  at  the  vertices  of  the  minimum  triangle  is  obtained  through  linear  interpolation  from  the  values  at  the 
vertices  of  the  prism.  Making  use  of  the  area  coordinates  of  a  we  can  now  use  linear  interpolation  to  get  an 
approximate  value  for  the  electric  vector  E  at  a.  Finally  the  rectangular  component  of  the  electric  field  at  the  point 
a  is  obtained. 

6.  Locally-Distorted-Grid  FVTD  and  FDTD 

The  FVTD  algorithm,  when  applied  to  a  primary  grid  consisting  of  cubes,  is  almost  identical  to  the  rectangular 
FDTD.  If  the  outer  boundary  vertices  of  the  FVTD  are  the  same  as  the  inner  vertices  of  the  FDTD,  it  is  quite 
simple  to  interpolate  the  FVTD  field  from  the  FDTD  field  and  vice  versa  on  the  interface  between  these  two 
algorithms.  Thus  when  a  locally  distorted  electric  grid  is  available,  one  could  use  the  FVTD  on  the  distorted  part 
of  the  grid  and  the  FDTD  on  the  undisiorted  part  of  the  grid.  One  can  also  use  FVTD  throughout.  The  only  reason 
one  uses  FDTD  at  all  in  this  case  is  because  of  the  bookkeeping  simplicity  of  FDTD. 

7.  Boundary  Condition  Simulations  in  the  FDTD/FVTD  Hybrid 

The  discretized  equations  for  FDTD  listed  in  section  3  can  only  be  used  for  "interior"  field  points  where  the 
electric  or  magnetic  circulations  can  be  obtained  from  the  variables  in  the  computational  volume.  Also  the 
discretized  equations  for  FVTD  listed  in  the  previous  section  can  only  be  used  for  "interior"  field  points  where  the 
electric  or  magnetic  vorticities  can  be  obtained  from  the  variables  in  the  computational  volume.  Near  the  boundary 
the  algorithms  have  to  be  modified.  Whatever  modification  used  is  governed  by  trying  to  make  maximum  use  of 
the  information  given  by  the  physical  boundary  conditions.  As  a  habit  of  computation  we  always  put  the  electric 
nodes  on  the  physical  boundary  and  the  magnetic  nodes  half  a  zone  away  from  the  boundary. 

In  the  following  we  present  two  approximate  boundary  condition  simulations  for  a  PEC  boundary  condition. 
These  are  listed  as  PEC  0  and  PEC  1  respectively.  Undoubtedly  there  exist  many  more  approximate  boundary 
condition  simulations.  They  are  all  very  similar.  PEC  0  was  introduced  in  our  previous  work  [2];  it  works  well 
for  smooth  surfaces.  PEC  1,  on  the  other  hand,  is  more  general  and  is  more  convenient  to  implement  It  has  also 
been  generalized  for  surfaces  with  edges  and  conical  tips.  More  discussion  on  boundary  condition  simulations  are 
included  in  our  work  [3,4]. 

PEC  Boundary  Condition  Simulation  0  (PEC  0) 

Shown  in  Fig.  8a  is  a  triangular  prism  representing  an  electric  element.  This  prism  has  a  triangular  base  1-4-8  at 
the  surface  of  the  scatterer.  In  the  updating  of  the  magnetic  field  at  the  centroid  of  this  element  we  need  the 

calculations  of  the  electric  vorticity  ( the  surface  integrals  ^hxEda  )  over  the  boundary  of  the  element.  We 
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show  how  to  approximate  the  electric  vector  at  the  center  ( point  d  )  of  the  lateral  surface  1-4-3-2.  Point  a  is  the 
midpoint  of  the  edge  1-2  and  point  b  is  the  midpoint  of  the  edge  3-4.  We  denote  the  unit  normal  to  the  triangle  14- 

8  by  ^  .  Surrounding  the  point  a  is  a  collection  of  magnetic  vertices  half  a  zone  from  the  scatterer.  In  these 
vertices  the  magnetic  vectors  are  located.  The  procedures  to  approximate  the  boundary  condition  are: 

1.  Make  use  of  the  magnetic  vector  in  this  collection  and  FDTD  ( this  collection  forms  a  dual  face)  to  obtain  the 

component  A  •  E(a) ;  where  ^  is  the  unit  vector  of  the  vector  area  of  the  dual  surface  surrounding  the  edge  1- 

2. 

2.  Since  £(1)— /zC/z*  £(1))  is  known  from  the  boundary  condition  and  E(2)  is  calculated,  we  have  the 
approximation 

E(a)=n(n  •  £(a))+y  |(£(2)-«(«  •  £(2)))+ (£(!)-«(«  •  £(1)))| . 

3.  Solve  for 

h  .  .  [(£(2)-n(n  .  £(2)))+ (£(!)-«(«  .  £(1)))}  ^ 


4.  Similarly,  solve  for  n*E{b)  . 

5.  Approximate  n*  E{d)=^[  h*  £(<3)+/z*  E(b)}  ;  this  is  the  component  of  the  electric  field  at  the 
normal  to  the  boundary  triangle. 


point  d 


6.  Also  approximate  E{dynin*  Eid)=^{{E(7)-h(n*  E(l))+(Ei6)-n(n^ £(6))}  ;  this  is  the 
projection  of  the  electric  field  at  the  point  d  in  the  plane  of  the  triangle. 

7.  Approximate  E{d)=n(n*  E(d))  +{S(d)-h{n*  Eid))},  making  use  of  the  approximation  in  step  5  and 
step  6. 

8.  The  electric  vorticity  from  the  lateral  face  1234  for  this  triangular  prism  is  approximated  by  ;  where 


A 


is  the  vector  area  of  the  lateral  surface  1432. 


element.  Point  a  is  the  midpoint  of  the 
the  updating  of  the  magnetic  field  at  the 

the  surface  integrals  jhxE  da)  over 

de 

the  boundary  of  the  element.  We  denote  the  unit  surface  normal  at  the  point  1  by  ^  .  Surrounding  the  edge  1-2 
is  a  collection  of  magnetic  vertices  half  a  zone  from  the  scatterer.  In  these  vertices  the  magnetic  vector  are  located. 
The  updating  of  the  magnetic  vector  at  the  centroid  of  the  prism  requires  the  electric  vorticity  over  its  boundary 
faces.  In  this  approximate  simulation  the  electric  vorticity  over  the  upper  and  lower  triangular  faces  is  calculated 
the  same  way  as  in  PEC  0.  We  deviate  from  PEC  0  in  the  calculation  of  the  electric  vorticity  over  the  lateral  faces. 
Instead  we  approximate  the  electric  vector  at  the  point  1 .  We  follow: 

1.  Make  use  of  the  magnetic  vector  in  surrounding  the  edge  1-2  and  FDTD  ( this  collection  forms  a  magnetic  face) 

to  obtain  the  component  A  •  E{a);  where  ^  is  the  unit  vector  of  the  vector  area  of  the  magnetic  surface 
suiTounding  the  edge  1-2. 

2.  Since  E(\)—h{n*  E{1))  is  known  from  the  boundary  condition  and  E(2)  is  calculated;  we  have  the 
approximation 

£(a)=n(n.£(a))+i{(£(2)-«(«»£(2)))+(£(l)-n(n«£(l)))|^ 


PEC  Boundary  Condition  Simulation  1  (PECl) 

Shown  in  Fig.  8b  is  an  edge  of  a  triangular  prism  representing  an  electric 
edge  1-2.  This  prism  has  a  triangular  base  at  the  surface  of  the  scatterer.  In 

centroid  of  this  element  we  need  the  calculations  of  the  electric  vorticity  ( 
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3.  Solve  for 


n*E(a)= 


14^ - j-ri  .  \[m)-n{h  .  £(2)))+ (£(!)-«(«  •  £(1)))| 

A»h  2A*n  'J' 


4.  Extrapolate  to  get  £(1) 

«•£(!)=  2n«£(£2)-;i»£(2). 


5.  Approximate  E(l)=n(n-E(m[E(l)-n(n> 

With  the  electric  vector  given  at  the  vertices,  the  calculation  of  the  electric  vorticity  over  the  lateral  faces  follows 
the  same  procedures  as  that  in  the  FVTD  algorithm. 


8.  A  StabUizing  Technique 

There  have  been  reports  of  late  time  growth  in  the  FDTD-like  codes  [15-18].  We  also  experienced  such  late  time 
growth  sometimes  in  our  FDTD/FVTD  hybrid.  This  can  be  a  serious  defect  It  has  prevented  meanin^ul 
calculations  for  a  long  period  of  time.  In  [1,4]  a  time  averaging  for  the  magnetic  vector  is  used  to  stabilize  the 
calculation.  We  discovered  that  a  a  spatial  averaging  of  the  magnetic  field  works  well  in  our  FDTD/FVTD  hybrid. 
From  the  way  our  rectangular  grid  overlaps  the  bodily  conformal  grid,  it  can  be  seen  that  calculations  can  be 
carried  out  if  at  each  time  step  we  pass  the  boundary  electric  field  data  from  one  grid  to  the  other.  Thus  at  each 
time  step  the  electric  vector  at  the  outer  boundary  of  the  curvilinear  grid  must  be  supplied  through  interpolation 
from  the  electric  field  calculated  in  the  rectangular  grid  by  the  FDTD  algorithm;  and  the  FDTD  rectangular  field 
component  at  the  inner  rectangular  grid  boundary  must  be  obtained  through  interpolation  of  the  electric  vector 
calculated  by  the  FVTD  algorithm.  The  magnetic  field  in  each  grid  can  be  obtained  through  the  regular  algorithm. 
We  discovered  that  the  procedures  discussed  above  produce  computation  with  a  late  time  growth!  The  growth  is 
not  a  strong  instability  and  it  always  happened  (for  the  small  objects  we  tried)  after  the  dominant  return  has  passed. 
We  heuristically  explain  the  instability  in  the  following: 

The  curvilinear  grid  with  the  inner  boundary  being  the  scattering  object  and  the  outer  boundary  being  the 
curvilinear  outer  boundary  form  a  "numerical  resonator"  very  much  like  a  physical  resonator  consisting  of  the  PEC 
boundary.  From  theory  we  know  that  there  exist  resonant  modes  that  satisfy  the  PEC  condition.  The  excitation  of 
these  resonant  modes  is  the  truncation  error  of  the  double  interpolation.  The  rectangular  FDTD  field  is  a  radiating 
field.  The  resonant  field  gets  artificially  radiated  through  the  interpolation.  We  therefore  thought  of  a  scheme  to 
"disrupt"  the  numerical  resonant  modes.  We  noticed  that  in  the  overlapping  region  there  will  be  curvilinear 
magnetic  field  locations  where  the  magnetic  vector,  in  addition  to  being  updated  by  the  curvilinear  FVTD 
algorithm,  can  be  approximated  through  interpolation  of  the  calculated  rectangular  components.  We  take  an 
average  of  the  calculated  (by  FVTD)  value  and  the  interpolated  (data  from  FDTD)  value.  For  our 
unstructured  curvilinear  grid,  it  is  more  convenient  to  "mix"  the  magnetic  field  than  to  "mix"  the  electric  field. 
This  late  growth  phenomenon  also  occurs  in  our  FDTD/FDTD  hybrid  algorithm  [1].  In  our  numerical  simulation 
for  a  finite  circular  cylinder  we  tried  several  types  of  "mixing"  and  found  that  averaging  the  calculated  and 
interpolated  magnetic  field  works  quite  satisfactorily.  When  we  observed  the  late  time  growth  in  our  FDTD/FVTD 
we  employed  the  same  "mixing"  as  in  the  FDTD/FDTD.  And  it  works!  Undoubtedly  there  are  other  legitimate 
approaches  one  might  try.  In  our  FDTD/FDTD  hybrid  [1]  we  carried  out  other  types  of  mixings. 

9  Sample  Calculations 

We  have  performed  two  sample  calculations  with  our  algorithms.  Example  1  is  a  PEC  sphere  and  example  2  is  a 
PEC  ellipsoid  of  revolution.  The  outer  boundary  of  the  rectangular  FDTD  grid  is  about  one  wavelength  from  the 
smallest  box  containing  the  scatterer.  In  the  outer  boundary  of  the  rectangular  FDTD  grid  we  use  our  tapered 
damping  [5]  for  the  sphere  calculations  and  for  the  ellipsoid  of  revolution  we  used  the  Mur  absorbing  boundary 
approximation.  The  incident  wave  is  a  linearly  polarized  plane  Gaussian  pulse.  The  far  field  scattered  time  signal 
is  Fourier  transformed  to  yield  the  frequency  response.  The  radius  of  the  sphere  is  1  meter.  We  present  two  sets  of 
calculations  based  on  the  number  of  surface  triangles  used.  Our  bodily  conformal  grid  consists  of  four  layers  of 
triangular  prisms.  This  body  grid  intersects  an  outer  rectangular  grid.  The  of  the  rectangular  grid  depends  on 
the  number  of  surface  triangles  used;  and  the  height  of  the  triangular  prisms,  Ar,  is  comparable  to  A  .x  .  For  a 
surface  grid  of  about  1200  triangles  with  the  lengths  of  the  uiangles  varying  from  .08  meters  to  .15  meters,  we 
chose  A;<  =0.10  meter,  and  Ar  =  0.1  meter  The  FDTD  inner  boundary  is  in  the  neighborhood  of  0.5 Ax  to  l.OAx 
from  the  sphere.  When  the  number  of  triangles  is  4800  (  four  limes  that  with  1200  triangles)  we  used  Ax=  .05m 
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and  Ar=.05m.  The  boundary  condition  simulation  is  PEC  1.  In  Figures  9a,  and  9b  we  show  the  RCS  calculations  ( 
for  two  different  numbers  of  surface  triangles)  and  compared  them  to  the  Mi e  series  calculation.  Also  included  in 
Figure  9a  is  the  staircase  calculations.  In  Figure  10  we  show  the  RCS  calculation  at  150  MHz  of  the  PEC  ellipsoid 
of  revolution.  The  boundary  condition  simulation  is  PEC  0.  This  calculations  are  compared  with  that  of  a  MOM 
BOR  code.  It  should  be  pointed  out  that  the  size  of  the  rectangular  cell  should  be  comparable  to  the  size  of  the 
surface  triangles.  Therefore,  as  the  number  of  surface  triangles  increases,  the  number  of  rectangular  cells  will  also 
increase.  For  the  same  Ax  the  computation  time  for  the  FDTD/FVTD  hybrid  is  roughly  four  times  that  required  for 
the  staircase  FDTD;  and  the  memory  requirement  can  easily  exceed  two  limes  that  for  the  FDTD  as  one  has  to 
store  the  interpolation  coefficients.  On  the  other  hand,  the  FDTD/FVTD  hybrid  is  much  more  accurate  than  the 
staircase  FDTD  as  illustrated  in  Figure  9a. 

10.  Conclusion 

We  have  presented  in  this  paper  a  time-domain  hybrid  FDTD/FVTD  scheme.  For  smooth  scatterers  the 
conformal  volume  grid  can  be  generated  with  a  surface  grid  for  which  a  number  of  commercial  grid  generators  are 
available.  Because  the  conformal  grid  is  only  a  few  layers  of  prisms,  it  is  rather  simple  to  generate  compared  to  a 
full-fledged  three  dimensional  grid  that  existing  generalized  TOTD  scheme  would  required.  We  believe  that  we 
have  introduced  a  new  technique  which  may  be  applicable  to  a  larger  number  of  boundary- value  problems  in  the 
time  domain  numerical  electromagnetics. 
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Radiation  boundary 


Near  field  to  far  field 
extrapolating  surface 


Inner  boundary  of  the 
rectangular  grid 


Curvilinear  body  grid  - 
Outer  boundary 


_ 


Fig.  5  The  Overlapping  Conformal  FDTD/FVTD  Grid 


Fig,  6  The  Outer  Boundary  of  the  Conformal  Grid  (a)  and  the 
Inner  Boundary  of  the  Rectangular  Grid  (b). 


462 


Fig.  7  Triangles  of  a  Prism. 


The  magnede  contour  for  updadng  a  component 
(along  £)  of  the  electric  field  at  the  point  a 


a 


Fig.  8  PEC  Boundary  Condition  Simulation  -  a.  for  PEC  0  and  b.  for  PEC  1 
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Reducing  the  Number  of  Time  Steps  Needed  for 
FDTD  Antenna  and  Microstrip  Calculations 

R.  J.  Luebbers  and  H.  S.  Langdon 
Electrical  Engineering  Department 
The  Pennsylvania  State  University 
University  Park,  PA  16802 


INTRODUCTION 

The  finite  difference  time  domain  (FDTD)  method  [1,2]  has  become  an  increasingly  popular  approach  for 
analyzing  the  electromagnetic  performance  of  antennas  and  microstrip  devices.  With  transient  excitation 
it  provides  impedance  and  scattering  parameters  over  a  wide  fi-equency  band  with  one  calculation  and 
application  of  the  fast  Fourier  transformation  (FFT).  One  difficulty  with  FDTD  is  that  tens  or  even 
hundreds  of  thousands  of  time  steps  may  be  required  for  the  transient  fields  to  decay. 

Previous  authors  have  used  different  approaches  to  try  to  reduce  the  number  of  required  time  steps.  In  [3] 
for  example,  the  transient  excitation  source  for  a  microstrip  line  is  located  near  the  FDTD  outer  boundary. 
After  the  transient  source  amplitude  has  fallen  to  zero,  the  source  is  removed  and  replaced  with  the  FDTD 
absorbing  boundary.  While  this  approach  reduced  the  number  of  time  steps  needed,  it  is  cumbersome  to 
apply  for  general  geometries.  It  also  requires  that  the  feed  location  be  far  enough  from  any  geometry 
features  so  that  no  reflections  return  to  the  feed  location  before  the  source  is  removed  and  the  outer 
absorbing  boundary  switched  on.  For  these  reasons,  this  approach  does  not  appear  to  be  applied  very  often 
in  the  literature. 

A  more  common  approach  is  to  apply  signal  processing  methods  to  predict  the  voltages  and  currents  at 
later  times  from  the  results  computed  for  early  times.  Instead  of  making  FDTD  calculations  for  the  lull 
30,000  time  steps  required  for  transients  to  dissipate,  one  might  make  actual  FDTD  calculations  for  8,000 
time  steps  and  use  these  results  to  predict  those  for  the  later  times.  A  number  of  papers  advocating 
variations  of  this  approach  have  appeared.  Two  recent  ones  are  [4,5],  and  there  are  a  number  of  earlier 
attempts  cited  in  these. 

Applying  the  various  prediction  methods  adds  additional  complexity  to  the  FDTD  calculation  process.  The 
prediction  methods  are  complicated,  and  may  require  care  and  skill  by  the  user  to  obtain  accurate  results. 
Most  of  the  methods  described  require  the  user  to  determine  the  order  of  the  prediction  process,  related 
to  the  number  of  terms  of  whatever  expansion  function  is  used  to  approximate  the  FDTD  time  signal.  A 
poor  choice  for  the  order  of  the  prediction  model  can  result  in  large  prediction  errors. 

In  this  paper  we  present  an  extremely  simple  approach  for  reducing  the  number  of  FDTD  time  steps  needed 
for  many  resonant  antenna  and  microstrip  calculations.  The  approach  is  based  on  using  a  source  with  an 
internal  resistance  to  excite  the  problem.  An  internal  source  resistance  was  used  in  [6]  for  excitation  of 
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microstrip  patch  antennas.  However,  no  illustration  of  the  advantages  of  using  and  internal  source 
resistance  for  reducing  FDTD  calculation  time  was  presented. 

In  the  following  sections  of  this  paper  the  approach  will  be  presented,  and  an  example  calculation  for  a 
microstrip  antenna  will  be  used  to  illustrate  the  advantages. 


RESISTIVE  SOURCE  FDTD  EXCITATION 

FDTD  antenna  or  microstrip  transient  calculations  are  typically  excited  by  a  ‘'hard”  voltage  source.  That 
is,  the  internal  source  resistance  is  zero  ohms.  These  source  are  very  easy  to  implement  in  an  FDTD  code. 
The  electric  field  at  the  mesh  edge  where  the  source  is  located  is  determined  by  some  function  of  time 
rather  than  by  the  FDTD  update  equations.  A  common  choice  is  a  Gaussian  pulse,  but  other  functions  may 
be  used.  The  Gaussian  pulse  is  significantly  greater  than  zero  amplitude  for  only  a  very  short  fraction  of 
the  total  computation  time,  especially  for  resonant  geometries  such  as  many  antennas  and  microstrip 
circuits.  Once  the  pulse  amplitude  drops  the  source  voltage  becomes  essentially  zero,  the  source  effectively 
becoming  a  short  circuit.  Thus  any  reflections  from  the  antenna  or  microstrip  circuit  which  return  to  the 
source  are  totally  reflected.  The  only  way  the  energy  introduced  into  the  calculation  space  can  be 
dissipated  is  through  radiation,  or  by  absorption  by  lossy  media  or  lumped  loads.  For  resonant  structures 
there  are  fi'equencies  for  which  this  radiation  or  absorption  process  requires  a  relatively  long  time  in  order 
to  dissipate  the  excitation  energy.  Using  a  source  with  an  internal  resistance  to  excite  the  FDTD  calculation 
provides  an  additional  loss  mechanism  for  the  calculations. 


Consider  that  it  is 
desired  to  excite  an 
FDTD  calculation  with 
a  voltage  source  that 
corresponds  to  an 
electric  field  E  in  the  z 
direction  at  a  certain 
mesh  location.  The 
correspondin  g 
equivalent  circuit  for  a 
voltage  source  which 
includes  an  internal 
source  resistance  Rs  is 
illustrated  in  Figure  1 . 
If  the  source  resistance 
Rs  is  set  to  0,  then  the 
usual  FDTD  electric 
field  at  the  source 
location  is  simply  given 
by  E-Vs/Az.  Vs  is  any 
function  of  time,  often 
a  Gaussian  pulse. 


Rs 


Figure  1  FDTD  source  with  source  resistance  Rs. 
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But  with  the  source  resistance  included,  the  calculation  of  the  source  field  E  at  each  time  step  is 
complicated  slightly.  In  order  to  determine  the  terminal  voltage  V  and  thus  the  FDTD  electric  source  field 
E,  the  current  through  the  source  must  be  determined.  This  can  be  done  by  taking  the  line  integral  of 
magnetic  field.  Example  Fortran  code  for  performing  the  required  calculation  for  a  Gaussian  pulse  is: 

IF  (T.LE.PRIOD)  THEN 
EZS(22, 15,4)=VS/DELZ* 

$  EXP(-ALPH*((T-BETADT)**2))+ 

$  ((HX(22,14,4)-HX(22,15,4))*DELX+ 

$  (HY(22,15,4)-HY(21,15,4))*DELY)* 

$  RS/DELZ 
ELSE 

EZS(22,15,4)- 

$  ((HX(22,14,4)-HX(22,15,4))*DELX+ 

$  (HY(22,15,4)-HY(21,15,4))*DELY)* 

$  RS/DELZ 
ENDIF 


For  this  example  RS  is  the  source  resistance  Rs,  EZS  is  the  Source  Electric  Field  in  the  z  direction,  in  this 
case  at  mesh  location  (22,15,4).  T  is  time,  and  PRIOD  is  the  time  duration  of  the  Gaussian  pulse,  outside 
of  which  it  is  approximated  as  zero  amplitude.  VS  is  the  Vs  internal  source  voltage,  DELZ  is  the  z 
dimension  mesh  size,  ALPH  and  BETADT  control  the  Gaussian  pulse  width,  and  HX  and  HY  are  the 
magnetic  fields 


surrounding  the  electric 
field  source  location.  If 
the  constant  RS  is  set  to  0 
in  the  example  Fortran, 
corresponding  to  Rs=0, 
then  the  usual  hard 
voltage  source  results. 
Clearly  it  is  quite  simple  to 
implement  a  voltage 
source  with  internal 
resistance. 


EXAMPLE  RESULTS 

To  illustrate  the  advantage 
of  using  a  source  with  an 
internal  resistance,  S^  vs 
frequency  for  a 
rectangular  microstrip 
antenna  over  a  dielectric 


Figure  2  Geometry  of  microstrip  patch  antenna  from  [3].  All  dimensions 


in  mm. 
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substrate  will  be 
calculated.  The  geometry 
is  similar  to  that  shown  in 
Fig.  3  of  [3],  The  only 
difference  is  that  the  50Ay 
long  feed  line  (2.46  mm 
wide  in  Figure  2)  is 
reduced  to  lOAy  in  the 
example  calculation  done 
here.  Thus  the  feed 
location  now  coincides 
with  the  reference  plane 
for  phase  in  [3],  This 
allows  the  FDTD  space  to 
be  smaller,  reducing 
calculation  time  and 
memory  requirements. 
The  microstrip  antenna  is 
located  on  a  0.794  mm 
thick  dielectric  substrate 
of  relative  permittivity  2.2 
over  a  ground  plane.  The 

cell  sizes  used  are  Ax=0.389mm,  Ay=0.40mm,  and  Az=0.265inni,  the  same  as  in  [3],  However  the  time 
step  used  is  6.407x10'”  seconds,  larger  than  used  in  [3],  The  problem  space  size  is  69x80x18  cells.  The 
antenna  is  fed  using  a  z- 
directed  electric  field  just 
above  the  ground  plane 
and  directly  below  the 
end  of  the  stripline  as 
marked  in  Figure  2.  This 
electric  field  feed  mesh 
location  is  then 
connected  to  the  end  of 
the  feed  line  by  additional 
z-directed  mesh  edges  of 
perfect  conductor.  The 
outer  boundary  is  second 
order  stabilized  Liao  [7 
,8  ]  except  for  the  x=0 
surface  which  is  the 
perfectly  conducting 
substrate.  All 

calculations  were  made 
with  the  XFDTD 
software  package  [9]. 


Rectangular  Microstrip  Antenna 


Figure  4  Source  current  Is  with  source  resistance  Rs-0,  128,000  time 
steps. 


Figure  3  Source  voltage  V  with  source  resistance  Rs=0.  Only  the  early 
time  steps  are  shown. 
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For  the  first  set  of 
calculations  the  source 
resistance  Rs  was  set  to 
0  Ohms,  which 
corresponds  to  the 
typical  FDTD  hard 
source.  The  source 
voltage  V  which 
determines  the  FDTD 
source  electric  field  vs 
time  is  shown  in  Figure 
3.  For  the  hard  source 
with  no  internal 
resistance,  this  is  of 
course  just  the  Gaussian 
pulse.  The 

corresponding  source 
current  Is  calculated  by 
FDTD  is  shown  in 
Figure  4.  This 
calculation  was  made  for 
128,000  time  steps,  and 

still  the  current  has  not  completely  dissipated. 


Rectangular  Microstrip  Antenna 


Figure  5  Source  voltage  V  with  source  resistance  Rs=50  Ohms.  All  4,000 
time  steps  are  shown. 


The  corresponding 
calculations  with  a  source 
resistance  Rs-50  Ohms 
are  shown  in  Figures  5 
and  6.  The  same 
Gaussian  pulse  Vs  was 
used  for  both  calculations. 
The  source  voltage  V 
shown  in  Figure  5, 
determined  by  saving  the 
source  electric  field  vs 
time,  is  no  longer  just  the 
Gaussian  pulse  since  the 
voltage  across  the  source 
resistance  Rs  is  also 
included.  The  source 
current  Is  converges 
much  faster  to  zero 
amplitude,  reducing  the 
FDTD  calculation  time  by 


Rectangular  Mlcrostrlp  Antenna 


Figure  6  Source  current  Is  with  source  resistance  Rs=50  Ohms.  All  4,000 
time  steps  are  shown. 
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a  factor  of  32.  Indeed,  it  appears  from  Figure  6  that  fewer  than  4,000  time  steps  would  be  sufficient. 

What  about  accuracy? 

Using  the  results  from 
both  FDTD  calculations 
shown  above,  the  source 
voltages  and  currents  are 
Fourier  transformed  using 
an  FFT  with  the  same 
number  of  terms.  The 
result  V  and  IS  results  for 
4,000  time  steps  were 
padded  with  zeroes  to  fill 
the  FFT.  Then  the 
resulting  complex 
voltages  and  currents 
were  divided  at  each 
frequency  to  determine 
the  input  impedance  at 
the  feed  location.  The 
stripline  characteristic 
impedance  Zq,  as  in  [3], 
was  taken  to  be  50  Ohms. 

Then  S„  -  (Z^  - 
+  Zo). 

The  results  obtained  for  both  calculations  are  shown  in  Figure  7.  The  agreement  between  the  two 
calculations  is  excellent,  with  the  4,000  and  128,000  time  step  curves  being  indistinguishable  for  most  of 
the  plot.  The  result  obtained  with  Rs==50  is  actually  more  accurate  at  the  lower  frequencies.  The  result  with 
RsK)  still  has  not  quite  converged  after  128,000  time  steps,  and  this  causes  the  ripple  in  the  corresponding 
low  frequency  results.  More  importantly,  the  result  with  the  50  Ohm  source  resistance  required  1/32 
of  the  computation  time  of  the  calculation  made  with  the  hard  source  with  Rs=0.  Both  results  agree  well 
with  the  measured  and  calculated  results  of  Sheen  et  al  [3],  which  are  also  shown  Figure  7.  Even  with  the 
longer  stripline  feed  and  removal  of  the  source,  the  FDTD  calculations  of  [3]  required  8,000  time  steps. 


Rectangular  Microstrip  Antenna 


Figure  7  Return  loss  parameter  for  both  FDTD  calculations  of  this 
paper,  compared  with  calculations  and  measurements  from  [3]. 


CONCLUSIONS 

A  simple  approach  for  reducing  the  number  of  time  steps  required  for  FDTD  calculations  for  resonant 
antennas  and  micro  strip  has  been  presented.  It  requires  no  additional  memory,  and  can  be  implemented 
with  a  few  lines  of  computer  code.  No  time-varying  calculation  parameters  or  complicated  post-processing 
is  required.  For  the  example  shown,  accurate  results  were  obtained  with  1/32  of  the  calculation  time 
compared  with  the  typical  FDTD  calculation. 
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I.  Introduction 

From  the  exact  Maxwell’s  equations,  solutions  have  been  obtained  that  are  similar  to  light  bullets  [1], 
i.e.  optical  pulses  that  are  self-supporting  under  the  effects  of  diffraction,  anomalous  dispersion  and  nonlinear 
refraction.  These  pulses  propagate  stably,  without  any  essential  changes  in  shape  or  spectral  content.  For 
comparison,  standard  theory,  which  uses  the  nonlinear  Schrodinger  equation  (NLSE),  (an  approximation  to 
Maxwell’s  equations),  predicts  [1]  that  under  the  effects  of  nonlinear  refraction,  self-focusing  will  lead  to 
the  collapse  of  optical  pulses.  Also,  additional  calculations  show  that  when  two  of  these  pulses  are  counter- 
propagating,  upon  interacting,  they  change  each  others’  trajectories.  These  pulses  are  extremely  small, 
approximately  25  fs  in  duration  and  contain  about  5  wavelengths. 

The  solutions  are  computed  by  solving  the  time  dependent,  vector,  nonlinear  Maxwell’s  equations  in 
two  space  dimensions  [2-5].  The  linear  dispersion  is  modeled  by  a  Lorentzian  model  with  a  single  resonance 
and  the  nonlinear  refraction  is  modeled  by  a  Kerr-like  instantaneous  nonlinearity.  The  optical  earner  is 
retained  in  the  calculations  so  that  each  wavelength  in  the  pulse  is  resolved.  Also,  since  the  Maxwell’s 
equations  are  solved  exactly,  all  higher  order  effects,  such  as  higher  order  dispersion,  are  automatically 
accounted  for  in  these  simulations. 

For  comparison,  standard  methods  of  analysis  and  numerical  simulation  use  the  NLSE.  That  a^ 
proach  solves  only  for  the  envelope  of  the  pulse.  It  assumes  that  the  envelope  is  slowly- varying  and  that  it 
contains  one  hundred  to  one  thousand  wavelengths  [6].  Also,  additional  terms  must  be  added  to  the  NLSE 
to  account  for  higher  order  effects  [7].  Hence  the  NLSE  cannot  be  used  for  the  simulation  of  the  small  pulses 
in  this  study.  Its  approximations  may  account  for  its  inability  to  predict  stable  light  bullets. 

The  numerical  algorithm  employed  in  this  paper  is  described  in  Refs.  2-5.  It  was  used  for  calculations 
of  propagating  and  interacting  temporal  [2-3,5]  and  spatial  [4,5]  solitons.  In  those  simulations,  the  nonlinear 
effects  included  Kerr-like  instantaneous  as  well  as  Raman  dispersive  nonlinearities.  Convolution  integrals 
described  the  linear  and  nonlinear  dispersive  effects.  Their  evolution  was  determined  by  a  coupled  system  of 
nonlinear,  ordinary  differential  equations.  These  equations  were  nonlinearly  coupled  to  Maxwell’s  equations 
through  the  electromagnetic  field.  This  entire  system  of  equations  was  solved  by  a  finite- difference  algorithm, 
as  described  in  Ref.  3.  With  that  development,  it  became  possible,  for  the  first  time,  to  solve  the  nonlinear, 
vector  Maxwell’s  equations  exactly,  without  having  to  go  use  approximations  such  as  the  NLSE.  In  this 
paper,  that  algorithm  is  used  under  the  simplification  that  the  nonlinearity  is  entirely  instantaneous. 

In  order  to  isolate  each  material  effect  on  the  optical  wave,  the  sequence  of  calculations  were  done 
methodically.  In  the  first  calculation,  only  dispersion  acted  on  a  pulsed  plane  wave.  Next  diffractive  effects 
on  a  beam  were  calculated.  These  two  effects  were  then  allowed  to  act  together  on  a  pulse.  Next  nonlinear 
refraction  was  introduced  as  the  third  effect,  with  the  result  that  a  stable  self-supporting  pulse  was  created. 
Finally,  two  of  these  pulses  interacted,  while  counter-propagating,  and  they  deflected  each  other. 

II.  Results 

The  results  are  shown  by  displaying  the  variation  of  the  electric  field  in  the  computational  domain. 
The  computational  domain  is  62  /im  along  the  direction  of  propagation  and  21  ;xm  in  the  transverse  direction. 

Figure  1  shows  the  results  of  dispersion  on  a  pulsed  plane  wave  at  four  instants  of  time  in  its 
propagation.  The  source  is  turned  on  at  the  left  side  of  the  domain.  The  results  are  shown  in  a  window  in 
the  computational  domain  that  is  centered  in  the  transverse  direction  and  that  is  13  pm  wide.  The  dispersion 
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in  the  material  is  characterized  by  a  single  Lorentzian  resonance  and  is  described  by  the  following  first  order 
susceptibility  function:  ~  ~  Here  Coo  =  9,  e,  =  21  and  =  6.0  x  10^'* 

rad/sec.  The  wave  length  of  the  plane  wave  is  A  =  1.55/im  (in  vacuum),  with  v  =  1.94  x  10^"^  Hz  or 
a;  =  1.22  x  10^®  rad/sec,  which  is  in  the  anomalous  dispersion  range.  The  electric  field  strength  is  2.9  x  10'® 
Volts/m.  The  pulse  has  a  hyperbolic  secant  envelope  with  a  characteristic  decay  time  constant  of  10.3 
fs.  Notice  that  as  the  wave  propagates,  the  pulse  lengthens  due  to  dispersion.  Each  band  represents  one 
wavelength. 

Figure  2  shows  the  results  of  diffraction  of  a  beam.  Again  the  source  is  turned  on  at  the  left  side 
and  the  four  plots  show  the  progression  of  the  beam,  including  the  beam’s  front  edge.  The  cross  sectional 
profile  of  the  beam  source  is  a  hyperbolic  secant  envelope  with  a  characteristic  length  of  0.517  /im,  which  is 
one  wavelength  of  the  optical  carrier  in  a  dielectric  with  an  index  of  refraction  equal  to  three.  As  the  beam 
propagates,  it  widens  due  to  diffraction. 

Figure  3  shows  the  effects  of  both  dispersion  and  diffraction  acting  on  a  pulse.  The  parameters  of 
dispersion  and  diffraction  are  those  from  the  previous  two  calculations.  The  plot  at  each  time  step  is  scaled 
by  the  maximum  and  minimum  values  of  the  electric  field  at  that  instant,  rather  than  using  global  values, 
which  would  wash  out  the  pulse’s  appearance  at  the  later  times  due  to  its  diminishing  amplitude.  Notice 
that  the  spreading  rates  of  the  pulse  due  to  dispersion  and  diffraction  are  approximately  equal.  Also  the 
individual  wavelengths  are  clearly  discernible  and  there  are  additional  wave  patterns  in  the  pulse. 

Figure  4  shows  the  pulse  propagating  under  the  combined  effects  of  anomalous  dispersion,  diffraction 
and  nonlinear  refraction.  The  strength  of  the  nonlinearity  is  given  by  the  nonlinear  index  coefficient  nj  = 
3.2  X  10“^®  m^/W,  (characteristic  of  silica  at  A  =  1.55  /im).  Here  the  length  of  the  computational  domain 
has  been  reduced  to  47  /tm  from  62  /im,  since  the  longer  length  is  no  longer  needed.  Notice  that  the  pulse 
is  now  self-supporting,  without  any  essential  changes  in  size  or  shape.  The  wavelengths  are  distinguishable 
and  there  are  about  five  wavelengths  in  the  pulse.  Around  the  core  of  the  pulse,  which  supports  itself,  is  a 
surrounding  low  level  outer  coat,  which  disperses  and  diffracts  away  from  the  core. 

Figure  5  shows  a  plot  of  the  electric  field  values  along  the  centerline  of  the  computational  domain  at 
the  four  times  shown  in  the  previous  figure.  Again  this  figure  shows  that  the  essential  shape  of  the  pulse  is 
maintained.  Figure  6  shows  the  temporal  power  spectrum  of  the  pulse  at  four  locations  along  the  centerline. 
Here  the  peak  of  the  spectrum  remains  essentially  unchanged.  Hence,  the  results  in  Figs.  4-6  show  that  the 
pulse  is  essentially  self-supporting  and  stable,  viz.  similar  to  a  light  bullet.  Also,  calculations  were  performed 
in  which  the  pulse  traveled  twice  the  distance  shown  in  Fig.  4  and  the  pulse  maintained  its  support. 

Figure  7  shows  that  when  two  of  these  pulses  are  counter-propagating,  upon  interacting,  they  change 
each  others’  trajectories.  Initially,  the  right  pulse  is  slightly  upward  in  comparison  to  the  left  pulse.  At  the 
third  instant  plotted,  the  pulses  have  just  passed  each  other  and  the  initially  higher  pulse  is  still  slightly 
higher.  By  the  four  instant,  the  plot  shows  that  the  two  pulses  have  become  aligned  vertically.  Although 
the  effect  is  small,  it  shows  that  one  of  these  light  bullet  like  pulses  can  alter  the  path  of  another  through 
an  interaction. 

The  calculations  were  performed  on  a  Cray  C-90  computer  and  took  180  seconds  per  case,  relatively 
minor  calculations.  The  first  author  thanks  Allen  Tafiove  of  Northwestern  University  for  many  helpful 
discussions. 
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Fig.  1.  The  electric  field  of  a  plane  wave  in  a  dispersive 
medium  after  155  fs,  310  fs,  465  fs  and  620  fs  of 
propagation. 


Fig.  4.  The  electric  field  of  a  pulse  undergoing  disper¬ 
sive,  diffraction  and  nonlinear  refraction  after 
155  fs,  310  fs,  465  fs  and  620  fs  of  propagation. 


-v-j. 


Pig.  2.  The  electric  field  of  a  beam  undergoing 
diffraction  after  155  fs,  310  fs,  465  fs 
and  620  fs  of  propagation. 


Fig.  5.  The  electric  field  amplitude  along  the  centerline 
of  the  computational  domain  after  155  fs,  310 
fs,  465  fs  and  620  fs  of  propagation. 


Fig.  6.  The  power  spectrum  of  the  pulse  along 
the  domain  centerline  and  located  at 
10  iim,  20  u,  30  u  and  40  u. 


Ultrawideband  Termination  of  Waveguiding  and  Multilayer  Structures 
for  FD-TD  Simulations  in  2-D  and  3-D 


Christopher  E.  Reuter*  ^  Rose  M.  Joseph^,  Eric  T.  Thiele^,  Daniel  S.  Katz^, 
Allen  Taflove^ 

iRome  Laboratory/ERST,  525  Brooks  Road,  Griffiss  AFB,  NY  13441-4505 
^Northwestern  University,  EECS  Department,  Evanston,  IL  60208 
^Cray  Research,  Inc.,  222  N.  Sepulveda  Blvd,  Ste.  1406,  El  Segundo,  CA  90245 
"^University  of  Colorado,  ECE  Department,  Boulder,  CO  80309 


Abstract 

We  describe  a  new  termination  for  FD-TD  models  of  metallic  and  dielectric  guided-wave 
structures  and  multilayer  structures  employing  the  perfectly  matched  layer  (PML)  absorbing 
boundary  condition  (ABC)  introduced  by  Berenger.  The  technique  is  accurate  over  a  wide  range 
of  group  velocities  and  requires  no  a  priori  knowledge  of  the  propagating  modes  or  dispersive 
properties  of  the  structure. 

Introduction 

Both  metallic  (perfect  electrically  conducting,  PEC)  and  dielectric  waveguiding  structures  are 
now  commonly  modeled  using  the  finite-difference  time-domain  (FD-TD)  method.  A  problem 
has  existed  in  the  termination  of  the  FD-TD  grid  for  the  case  where  the  structure  of  interest 
extends  beyond  the  bounds  of  the  model.  Since  the  propagating  energy  can  be  multimodal  and 
dispersive,  the  termination  must  absorb  energy  with  both  a  varying  group  velocity  and  varying 
transverse  distribution.  Previous  approaches  to  this  boundary  condition  problem  have  included 
one  way  wave  equations  [1],  error-cancelling  super-absorbers  [2],  outgoing  wave  annihilators 
[3],  and  the  Liao  theory  [4].  These  techniques  are  limited  to  a  narrow  spectrum  (allowing  the 
assumption  of  constant  group  velocity)  or  to  a  discrete  number  of  known  spectral  components  or 
modes  such  that  each  component  could  be  accounted  for  individually,  thus  requiring  a  priori 
knowledge  of  the  group  velocity  and  modal  distribution.  More  recently,  Moglie,  et.  al  [5] 
derived  an  analytical  waveguide  termination  by  Laplace  transformation  of  the  exact  expression 
for  the  group  velocity  within  the  waveguide.  This,  however,  is  computationally  burdensome 
requiring  the  evaluation  of  a  convolution  integral  for  each  mode  within  the  guide. 
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In  this  paper  we  describe  a  termination  for  FD-TD  simulation  of  guided  wave  structures 
employing  the  perfectly  matched  layer  (PML)  absorbing  boundary  condition  (ABC)  introduced 
by  Berenger[6].  Using  PML  absorbing  material  as  a  waveguide  termination  does  not  require  the 
propagating  field  energy  to  be  characterized  in  advance.  No  a  priori  knowledge  of  the 
propagating  modes  or  dispersive  properties  of  the  waveguiding  structure  is  required. 
Furthermore,  the  approach  is  local  in  time  and  space,  general,  and  extremely  accurate  over  a  wide 
range  of  group  velocities, 

PML  Waveguide  Termination 

The  Berenger  PML  ABC  is  a  new  approach  to  terminate  FD-TD  grids  using  a  numerical  analog 
of  a  nonphysical  material  impedance  matched  to  the  standard  FD-TD  grid  which  it  surrounds. 
Impedance  matching  between  the  FD-TD  grid  and  the  PML  is  attained  by  splitting  certain  field 
components  into  two  subcomponents.  Using  two  split  subcomponents  to  represent  one  regular 
FD-TD  field  component  gives  an  additional  degree  of  freedom  for  specification  of  material 
parameters.  Thus  it  is  possible  to  specify  a  material  in  which  waves  of  arbitrary  frequency  and 
angle  of  propagation  are  caused  to  decay  rapidly  while  maintaining  the  velocity  and  impedance 
of  the  numerical  media  from  which  they  originated.  Complete  details  of  the  PML  technique  are 
reported  [6,  7].  Application  of  the  PML  to  2-D  metallic  and  dielectric  waveguiding  structures  is 
reported  in  [8].  It  is  a  straightforward  extension  of  this  work  to  apply  the  PML  to  3-D  metallic 
and  dielectric  waveguides. 

It  is  to  desirable  obtain  the  minimum  possible  reflection  from  the  PML.  Three  parameters 
control  the  magnitude  of  the  numerical  reflections  at  grid-PML  interfaces;  1)  the  PML  thickness, 
2)  the  theoretical  reflection  at  normal  incidence,  R(0),  as  defined  by  Berenger,  and  3)  the  spatial 
profile  of  the  conductivity  within  the  PML.  Berenger  reported  limited  results  for  conductivity 
profiles  having  constant,  linear,  and  parabolic  distributions  [6].  Furthermore,  since  computer 
resources  are  limited,  the  above  three  parameters  must  be  optimized  to  meet  the  required  model 
fidelity  and  to  enable  solution  of  the  problem  within  limits  of  the  available  computer  system.  A 
thicker  PML  generally  reduces  reflections  but  requires  larger  memory  and  longer  CPU  run  time. 
Modification  of  the  remaining  two  parameters  has  insignificant  effect  on  the  required  memory 
and  run  time.  However,  the  relationship  between  these  two  parameters  and  the  magnitude  of  the 
reflection  is  not  obvious  and  warrants  investigation.  We  have  performed  numerical  experiments 
to  determine  the  optimal  values  for  these  parameters  while  limiting  computational  requirements. 
The  results  reported  for  2-D  and  3-D  waveguides  in  this  paper  were  obtained  using  the  optimized 
parameters. 

Results 

The  first  case  reported  is  that  of  an  an  air-filled  PEC  parallel-plate  waveguide  having  a  plate 
separation  of  40  mm  (fcutoff  =  GH-z).  Excitation  consists  of  an  83.3-ps  Gaussian  pulse 
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(FWHM)  modulating  a  7.5  GHz  carrier  that  launches  a  TM^  mode  towards  the  PML 
termination.  The  spectrum  of  the  excitation  and  the  normalized  group  velocity  for  the  TMj 
mode  are  shov^na  in  Fig.  1.  Note  that  the  input  pulse  has  significant  energy  in  the  region  of  high 
dispersion  on  the  group  velocity  curve.  The  reflection  coefficient  versus  frequency  is  obtained 
by  dividing  the  reflected  spectrum  by  the  incident  spectrum  as  observed  at  the  air-PML  interface. 
We  report  the  reflection  coefficient  of  the  PML  termination  for  two  sets  of  PML  parameters.  For 
the  first  set  we  used  an  8  cell  thick  PML  having  a  quadratic  conductivity  profile  and  an  R(0)  = 
10'^.  These  are  typical  PML  parameters  that  would  be  used  in  a  situation  where  memory  is 
limited.  The  second  type  of  PML  tested  has  a  cubic  conductivity  profile,  an  R(0)  =  1 0'^,  and  is 
32  cells  thick.  These  parameters  would  be  used  in  simulations  where  it  is  required  to  obtain  the 
absolute  lowest  possible  reflection  from  the  PML. 

Fig.  2  graphs  the  reflection  coefficient  versus  frequency  for  these  two  sets  of  PML  parameters. 
Reflections  better  than  -60  dB  are  obtained  throughout  the  entire  propagation  regime  for  the  first 
set  of  PML  parameters.  If  less  reflection  is  required,  the  32  cell  PML  could  be  implemented 
yielding  a  reflection  coefficient  of  better  than  -100  dB  for  the  majority  of  the  propagation  regime. 

The  next  case  reported  is  a  three-dimensional,  20  x  40  mm  PEC  waveguide  with  fcutoff  ^ 

GHz.  A  pulse  with  spectral  content  similar  to  that  of  the  2-D  case  above  was  used  to  excite  a 
TEio  mode  in  the  waveguide.  Again  we  report  two  PML  terminations  having  different 
properties.  The  first  PML  is  8  cells  thick,  has  a  fourth  order  conductivity  profile,  and  an  R(0)  = 
10'^.  The  second  PML  is  24  cells  thick,  has  a  fourth  order  conductivity  profile,  and  R(0)  =  10"^. 
Fig.  3  graphs  the  resulting  reflection  coefficients  versus  frequency  for  both  PML  terminations. 
The  reflection  coefficient  is  better  than  -60  dB  for  the  8  cell  PML  and  better  than  -80  dB  for  the 
24  cell  PML  throughout  most  of  the  propagation  regime.  Thus,  the  PML  can  be  tuned  to  obtain 
less  reflection. 

Reference  [8]  gave  results  for  PML  termination  of  a  2-D  dielectric  optical  slab  waveguide.  In  the 
oral  presentation  we  will  provide  additional  results  for  such  structures  with  optimized  PML 
parameters  and  also  extend  the  work  to  3-D  modeling  of  strip-lines.  We  will  also  present  results 
for  using  PML  to  improve  FD-TD  modeling  of  multi-layer  structures,  particularly  the  earth-air 
interface. 

Conclusion 

We  have  demonstrated  the  Berenger  PML  to  be  a  wideband  termination  for  FD-TD  simulations 
of  dielectric  and  PEC  waveguides  in  both  2-D  and  3-D.  It  has  advantages  over  previously 
reported  tecnniques  including;  1)  it  is  local  in  time  and  space,  2)  it  appears  to  be  effective  for 
dispersive,  multimodal,  and  evanescent  energy,  and  3)  it  is  easily  implemented  and 
computationally  efficient. 
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Reflection  Coefficient  (dB) 


Figure  1.  Normalized  excitation  spectrum  and  group  velocity  versus 
frequency  for  the  TM^  mode  in  the  2-D  PEC  parallel  plane 
waveguide  (fcutoff  “  GHz). 


Frequency  (GHz) 


Figure  2.  PML  reflection  coefficient  versus  frequency  observed  just 
outside  the  PML  layer  for  two  PML  terminations  having 
different  properties:  2-D  PEC  waveguide  test  case. 
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8  12  16  ^ 
Frequency  (GHz) 

PML  reflection  coefficient  versus  frequency  observed  just 
outside  the  PML  layer  two  PML  terminations  having  different 
properties:  3-D  PEC  waveguide  propagating  a  pulsed  TEio 

mode  with  fcutoff  ^ 
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ABSTRACT — A  modified  set  of  Maxwell’s  equations  is  presented  that  includes  complex 
coordinate  stretching  along  the  three  Cartesian  coordinates.  The  added  degrees  of  free¬ 
dom  in  the  modified  Maxwell’s  equations  allow  the  specification  of  absorbing  boundaries 
with  zero  reflection  at  all  angles  of  incidence  and  all  frequencies.  The  modified  equations 
are  also  related  to  the  perfectly  matched  layer  that  was  presented  recently  for  2-D  wave 
propagation.  We  have  demonstrated  that  absorbing  material  boundary  conditions  are  of 
particular  interest  for  finite  difference  time  domain  (FDTD)  computations  on  a  single- 
instruction  multiple-data  (SIMD)  massively  parallel  supercomputer. 

In  addition,  we  will  show  that  PML  will  absorb  static  field  and  can  be  used  to  truncate 
a  finite-difference  mesh  for  electrostatic  and  magnetostatic  simulations. 

1.  Introduction 

The  finite  difference  time  domain  method  [1,  2]  is  widely  regarded  as  one  of  the  most 
popular  computational  electromagnetics  algorithms.  Although  FDTD  is  conceptually  very 
simple  and  relatively  easy  to  program,  the  method  is  actually  quite  efficient  since  it  involves 
0{N^'^)  computational  complexity  in  2-D  and  computational  complexity  in  3-D 

[3].  In  fact,  FDTD  can  be  considered  an  optimal  algorithm  since  0{N^)  numbers  are 
produced  in  0(iV“)  operations. 

FDTD  is  also  ideally  suited  for  implementation  on  a  single-instruction  multiple-data 
(SIMD)  massively  parallel  computer.  The  reason  is  that  the  stencil  operations  that  must 
be  computed  at  each  node  of  the  space  grid  involve  only  nearest-neighbor  interactions  and 
may  be  implemented  at  a  minimum  communication  cost  [4].  A  major  challenge,  however, 
is  in  implementing  absorbing  boundary  conditions  (ABCs)  at  the  edges  of  the  FDTD 
grid.  On  scalar  and  vector  computers,  these  boundary  conditions  are  typically  computed 
using  methods  such  as  the  Engqmst-Majda  [5],  Mur  [6],  Liao  [7]  or  Higdon  [8]  ABC. 
However,  these  methods  are  not  ideal  for  parallel  supercomputers  since  they  all  involve 
communication  with  many  elements  normal  to  the  grid  boundary,  which  can  easily  surpass 
the  time  spent  computing  core  FDTD  operations  in  the  grid  interior.  Also,  they  do  not 
allow  for  SIMD  operation  on  a  parallel  machine  without  the  use  of  masking. 

An  alternate  method  of  implementing  an  ABC  is  to  use  a  conventional  absorbing 
material  boundary  [4,  9-14]  .  For  SIMD  parallel  computation,  these  methods  have  the 
advantage  that  the  ABC  may  be  implemented  with  the  same  FDTD  stencil  operation  as 
the  interior  nodes  by  modifying  the  conductivity  material  parameter  at  the  edge  of  the 
FDTD  grid.  The  disadvantage  is  that  the  reflection  coefficient  at  the  absorbing  border  is 
zero  only  at  normal  incidence  and  is  both  angle  and  frequency  dependent.  Consequently, 


t  This  work  is  supported  by  the  Office  of  Naval  Research  under  grant  N000-14-89-J1286,  the  Army 
Research  Office  under  contract  DAAL03-91-G-0339,  the  National  Science  Foundation  under  grant  NSF- 
ECS-92-24466,  and  NASA  under  grant  NASA-N AG-2-871.  Computer  time  is  provided  by  the  National 
Center  for  Supercomputer  Applications  at  the  University  of  Illinois,  Urbana- Champaign. 
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the  absorbing  material  border  region  must  be  made  quite  large — typically  20-100  grid 
points  along  each  edge  in  order  to  minimize  reflections. 

Recently,  Berenger  [15]  suggested  a  more  general  method  of  implementing  an  absorbing 
material  boimdary  condition.  Berenger  proposed  a  procedure  for  2-D  wave  propagation 
whereby  Maxwell’s  equations  are  generalized  and  added  degrees  of  freedom  are  introduced. 
The  added  degrees  of  freedom  allow  the  specification  of  absorbing  borders  with  zero  re¬ 
flection  coefficient  at  all  angles  of  incidence  and  aU  frequencies.  Moreover,  the  generalized 
Maxwell’s  equations  reduce  to  the  familiar  Maxwell’s  equations  as  a  special  case  and  hence 
the  same  generalized  equations  can  be  used  to  propagate  fields  in  both  the  interior  region 
and  absorbing  region.  Although  the  interface  between  the  interior  region  and  the  absorb¬ 
ing  boundary  is  reflectionless,  there  is  stfll  a  reflection  from  the  edge  of  the  grid.  The 
advantage  of  using  Berenger ’s  procedure  is  that  much  larger  conductivity  values  may  be 
specified  in  the  absorbing  region,  leading  to  a  drastic  reduction  in  the  number  of  grid 
points  required  for  the  absorbing  boimdary. 


A  formulation  similar  to  the  Berenger  idea  has  been  derived  for  3-D  wave  propagation 
from  first  principles  using  a  coordinate  stretching  approach  [16].  The  advantage  of  the 
new  method  is  for  SIMD  parallel  computation.  The  method  is  validated  with  3-D  FDTD 
numerical  computations  on  a  Thinking  Machines  Corporation  Connection  Machine  CM-5. 

2.  Modified  Maxwell’s  Equations 

For  a  general  medium,  we  define  the  modified  Maxwell’s  equations  in  the  frequency 
domain,  assuming  time  dependence,  as 

Ve  X  E  =  ioofiJI 

(1) 

Vh  X  =  —iu)eE 

(2) 

Vh'  eE  =  p 

(3) 

Ve  •  piH  =  0 

(4) 

where 

„  .Id  .Id  .Id 

(5) 

„  .Id  .18  .18 

^  ^ hsc  dx  ^ hy  dy  ^  ^ hz  dz' 

(6) 

In  the  above,  e^,  i  =  x,y,z  are  coordinate-stretching  variables  that  stretch  the  x^y^z 

coordinates  for  Ve  and  V^-  For  a  plane  wave,  we  can  show  that  for  a  homogeneous  medium, 
this  gives  the  dispersion  relation 

(7) 

where  K  =  +  y^  +  and  kh  =  +  z^,  or 

^  4-  -J—p  4-  -J— i-2 

(8) 

where  =  a;^/xe.  Equation  (8)  is  the  equation  of  an  ellipsoid  in 

[16] 

3-D  and  is  satisfied  by 

^xhx  sin  0  cos  (/>, 

(9) 

ky  =  f^yjeyhy  sm  9  sin <^, 

(10) 
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and 


kz  —  ^zhz^osB^  (11) 

Note  that  when  ft;,  i  =  x,y,z  are  complex,  the  wave  in  the  x,  y,  and  2  directions  are 
attenuative  and  can  be  independently  controlled.  Under  the  matching  condition,  6x  —  ha,, 
Cy  =  hy,  and  =  h^,  we  have  |fcep  =  \kh\‘^  =  The  wave  impedance  is  then  given  by 


n 

\H\  ue  Ve’ 


(12) 


irrespective  of  the  values  for  ei,i  —  x,y,z  and  the  direction  of  propagation. 

3.  Single  Interface  Problem 

Assume  that  a  plane  wave  is  obliquely  incident  on  the  interface  z  =  0.  Furthermore, 
we  may  assume  that  the  plane  wave  is  of  arbitrary  polarization.  The  incident  field  may  be 
decomposed  into  a  sum  of  two  components,  one  with  electric  field  transverse  to  z  (TE") 
and  the  other  with  magnetic  field  transverse  to  z  (TM""). 

We  can  derive  their  reflection  and  transmission  coefficients  as  [16] 


^1zC2zM2  ^2zC1zM1 

(13) 

h\z€-2zl^2  +  ^2zCizMl 

and 

^TE  _ 

2A:ize2zM2 

(14) 

klz^2zfj'2  +  h2z^lz^l 

Similarly,  we  have 

klzh2z^2  —  h2zh\z^X 
kizh2z^2  +  k2zhiz^l 

(15) 

and 

JtTM  _ 

2kizh2ze2 

(16) 

kizh2z^2  +  k2zh\z^\ 

4.  A  Perfectly  Matched  Interface 

The  phase  matching  condition  requires  that  =  k2x  and  kiy  =  k2y,  or 

Ki\Jeixhix  sin^i  cos</>i  =  K2 \/e2a:^2x  sin ^2  cos  (t)2  (1"^) 


Ki  yj eiyhiy  sin  0i  sin  (pi  =  K2yje2yh2y  sin  62  sin  4>2  (1^) 

where  ki  =  and  K2  =  For  a  perfectly  matched  medium,  we  choose 

€x  =  €2,  Ml  =  M2,  =  hx  and  Cy  =  hy.  Equations  (17)  and  (18)  become 

eia;Sm^iCos(/>i  =  e2a;Sin^2COS(^2  (1^) 


eiy  sin  sin  =  e2y  sin  O2  sin  4>2  •  (20 ) 

If  we  now  choose  eix  =  esx  and  eij,  =  then  0i  =  6»2,  <Pi  =  h  and  we  can  show  that 

both  =  0  and  =  0  for  all  angles  of  incidence  and  all  frequencies. 
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If  region  1  is  a  vacuum,  then  fi  =  ^  and 

^lac?  hjy,  hi^)  =  (l,  1, 1, 1, 1, 1).  (21) 


In  order  to  have  a  lossy  region  2  with  no  reflections  at  the  region  1/region  2  interface,  we 


{c^2x's  ^2yi  ^2z)  ^2y)  ^2z)  —  (1>  1;  Ij  Ij  ^2) 

(22) 

where  $2  is  a  complex  number.  In  this  case. 

klx  =  ^2x  =  Ko  sin  ^  COS  (jf) 

(23) 

kly  —  k2y  —  KoSmSsin(f> 

(24) 

kiz  —  kq  cosO 

(25) 

k2z  =  ^0^2  COS  0 

(26) 

where  kq  —  If  $2  =  52  +  252,  the  wave  will  attenuate  in  the  2  direction, 

of  interface  is  useful  for  building  material  ABCs  in  a  FDTD  simulation. 

This  kind 

In  realistic  simulation,  we  wiU  have  to  choose  52  Hence,  when  the  frequency  goes 

to  zero,  k2z  above  becomes  a  purely  imaginary  number.  Therefore,  it  attenuates  static 
fields  as  well. 

5.  Modified  Equations  in  the  Time  Domain 

For  the  general  case  of  a  matched  medium,  we  let  ex  —  =  Sx^  ey  =  hy  =  Sy  and 

=  hz  =  Sz,  Next,  we  let  =  1  +  iax/coe,  Sj,  =  1  +  iayfcje  and  Sz  =  1  +  iuzl^e.  Writing 
the  equations  in  the  time  domain  [16],  we  have 

dHs^  CTxfJ^  „  ^  « 

dt 

(27) 

^  dt  +  c 

(28) 

^  dt  +  e 

(29) 

®  dt  ^  ~ 

(30) 

dE,  d 

^  dt' = 

(31) 

dEs  „  a . 

^  dt  = 

(32) 

Equations  (27)  -  (32)  described  3-D  wave  propagation  in  a  perfectly  matched  medium. 
The  wave  propagation  phenomenon  described  by  these  equations  is  very  similar  to  that 
described  by  Maxwell’s  equations  with  the  exception  that  attenuation  may  be  controlled 
through  the  cry  and  (t^  variables.  The  FDTD  implementation  of  these  equations  on  a 
Yee  FDTD  grid  is  straightforward.  Absorbing  boundaries  at  the  edges  of  the  simulation 
region  may  be  created  by  choosing  appropriate  values  of  tTa,,  (7y  and  (j^.  Equations  (27)  - 
(32)  may  be  seen  to  include  Berenger’s  equations  [15]  as  a  subset  for  the  2-D  TE  or  TM 
case. 
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6.  Computer  Simulation  Results 

To  study  the  PML,  we  calculate  the  reflection  coefficient  for  a  plane  wave  incident 
with  different  values  of  The  reflection  coefficient  is  calculated  for  a  step  conductivity 
[cTz)  profile  of  7  layers  (with  a  metal  backing)  shown  in  Figure  la.  Figure  lb  shows  the 
reflection  coefficient  magnitude  as  a  function  of  fcx/^o  where  =  ko  corresponds  to  grazing 
incidence.  Notice  that  the  reflection  coefficient  magnitude  is  always  1  at  grazing  incidence. 
(Tmax  is  the  maximum  value  of  for  the  step  profile  which  increases  in  conductivity  toward 
the  metal  backing.  The  overall  reflection  amplitude  is  smaller  for  increasing  (Tmax>  Notice 
that  the  evanescent  wave  {kx  >  ko)  are  well  absorbed. 


Layer  ,  v 

Figure  1:  (a)  (Left)  The  step  profile  (parabolic)  of  the  conductivity  (cr^) 

used  in  the  PML  absorbing  boundary,  (b)  (Right)  The  reflection  coefficient 
magnitude  as  a  function  of  a  plane  wave  incident  with  different  kx's  {x  being 
the  direction  parallel  to  the  interface)  and  different  (Tmax- 


In  Figure  2,  we  solve  the  Sommerfeld  problem  of  a  line  source  over  a  PML  medium 
backed  by  a  ground  plane.  The  line  source  is  located  at  the  origin,  with  the  ffist  interface 
of  the  medium  at  a  distance  ht  below  the  line  source.  The  observation  point  is  located 
at  {x,z).  The  amplitude  of  the  field  is  plotted  as  a  function  of  x  for  a  fixed  2.  Notice 
that  as  (Tmax  increases,  the  result  agrees  with  the  ideal  exact  result, _  which  is  the  case  in 
the  absence  of  the  ground-plane-backed  PML.  Also,  for  large  x,  the  discrepancy  cannot  be 
remedied  by  choices  of  (Jmax-  This  is  because  the  observer  at  larger  x  receives  most  of  the 
reflected  field  through  grazing  reflection,  where  the  PML  medium  does  not  absorb  well. 


Figure  2:  The  field  magnitude  as  a  function  of  x  for  an  electric  line  source 

over  a  PML  medium  backed  by  a  ground  plane  for  different  (Tmax  in  a  seven 
layer  parabolic  profile.  The  thickness  of  each  PML  is  0.6. 
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In  order  to  demonstrate  the  new  method,  a  3-D  orthogonal  grid  FDTD  algorithm  was 
developed  based  on  Equations  (27)  -  (32).  The  FDTD  algorithm  was  implemented  as  a 
SIMD  code  on  the  Thinking  Machines  Corporation  Connection  Machine  CM-5.  The  al¬ 
gorithm  operates  very  efficiently  on  the  CM- 5  because  the  FDTD  stencil  operations  that 
need  to  be  computed  at  each  node  involve  only  nearest-neighbor  interactions.  The  com¬ 
munication  operations  resulting  from  the  nearest-neighbor  interactions  are  at  a  minimum 
cost. 

The  FDTD  solution  was  solved  in  a  cubic  region  of  dimension  (iVa:,iVy,  A^^)  = 
(128, 128, 32)  grid  points.  The  grid  parameters  chosen  were  Ax  =  Ay  =  Az  =  2.5  mm, 
At  =  4.5  ps  and  Nt  =  512  time  steps  were  computed. 

The  infinitesimal  electric  dipole  was  simulated  by  exciting  the  Ey  field  in  a  single  grid 
cell  with  the  source  pulse 


(33) 

where  r  =  l/47r/o  and  a  value  of  fo  =  1,0  GHz  was  chosen.  The  dipole  source  was  located 
at  grid  location  {n^yriy,  n^)  =  (91, 64, 16).  The  and  Ey  fields  were  obtained  by  sampling 
the  fields  at  grid  location  {nx,ny,nz)  =  (37,91,16). 

The  absorbing  boundaries  used  for  the  FDTD  simulation  consisted  of  planar  layers  of 
thickness  8  grid  points  on  all  surfaces.  Along  the  borders  parallel  to  x  axis,  the  value  of 
was  specified,  while  ay  and  az  were  specified  on  the  borders  parallel  to  the  y  and  z  axis, 
respectively.  The  conductivity  values  were  chosen  with  a  parabolic  taper. 

The  Ex  field  computed  using  both  the  analytic  formulation  and  the  FDTD  algorithm 
are  overlaid  in  Figure  3.  The  curves  due  to  the  analytic  and  numerical  solutions  are  barely 
distinguishable,  indicating  excellent  agreement. 
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Figure  3:  Analytic  and  numerical  FDTD  solution  overlaid  for  the  Ex  field 

resulting  from  an  infinitesimal  electric  dipole. 

The  CM- 5  machine  used  to  solve  the  FDTD  problem  is  located  at  the  National  Center 
for  Supercomputing  Applications  (NCSA)  at  the  University  of  Illinois.  The  program  was 
written  in  CM  Fortran  and  compiled  using  CMF  version  2.1.  The  CM-5  at  the  NCSA  has 
512  nodes  with  vector  units.  CPU  times  were  determined  by  running  the  problem  on  32, 
64,  128  and  256  node  partitions.  For  this  problem,  a  total  of  1.5  million  unknown  field 
quantities  per  time  step 

(128  X  128  X  32  grid)  were  determined  for  512  time  steps.  The  CPU  times  are  shown 
in  Table  1. 

In  Figure  4,  we  show  the  absorption  of  electrostatic  field  by  a  PML  medium.  The  plots 
are  for  the  a;-components  of  the  electric  field  as  a  function  of  x  due  to  a  line  of  electric 
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Table  1:  CPU  times  for  FDTD  Problem  on  CM-5 


Nodes 

CPU  sec  (Run  1,  Run  2,  Run  3,  Avg.) 

32 

50.5,  50.2,  50.6;  50,4 

64 

29,9,  30.0,  30,0;  30.0 

128 

17.9,  18.4,  18.4;  18.2 

256 

12.4, 13.2, 12.7;  12.8 

dipole  pointing  in  the  rr-direction.  The  dotted  line  is  the  closed-form  solution.  The  dotted- 
dashed  line  is  the  simulation  with  PML  absorbing  boundary  condition.  The  solid  line  is 
the  result  from  simulation  using  a  modified  impedance  boundary  condition.  It  is  seen  that 
the  static  field  diminishes  exponentially  fast  near  the  edge  of  the  simulation  region.  In 
the  simulation  region  of  interest,  the  PML  result  agrees  much  better  than  the  impedance 
boundary  condition  simulation  when  compared  to  the  closed  form  solution. 


grid 

Figure  4:  Comparison  of  PML  simulation  (dotted- dashed),  impedance 

boundary  condition  simulation  (solid)  and  closed  form  (dotted)  for  the  x 
component  of  the  electric  field  due  to  a  line  source  of  static  electric  dipole. 

7.  Conclusions  ,  ,  .  i  j.  ^ 

A  modified  set  of  Maxwell’s  equations  have  been  introduced  using  complex  coordinate 
stretching  factors  along  the  three  Cartesian  coordinate  axis.  This  modification  introduces 
additional  degrees  of  freedom  in  Maxwell’s  equations  such  that  absorbing  boundaries  may 
be  specified  with  zero  reflection  coefficient  at  aU  frequencies  and  all  angles  of  incidence. 
The  formulation  was  shown  to  be  related  to  the  perfectly  matched  layer  that  was  recently 
derived  by  Berenger  for  2-D  wave  propagation.  A  3-D  FDTD  algorithm  was  developed 
from  the  modified  Maxwell’s  equations  that  uses  the  refiectionless  absorbing  interface  prop¬ 
erty  to  implement  radiation  boundary  conditions  at  the  edges  of  the  FDTD  grid,  and  was 
implsmented  on  the  Connection  Machine  CM-5  and  timing  results  were  presented.  This 
breakthrough  in  absorbing  material  boundary  conditions  allows  EM  scattering  to  be  com¬ 
puted  very  efficiently  on  SIMD  parallel  computers.  In  addition,  we  show  that  PML  does 
not  absorb  for  waves  at  grazing  incidence,  but  it  could  absorb  static  electromagnetic  field. 
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Abstract 


A  new  type  of  “Perfectly  Matched  Layer”  (PML)  is  developed  for  mesh  truncation  m 
finite  element^pplications.  The  concept  is  based  on  using  diagonaUy  anisotropic  media  in  an  artificial  absorber^ 
The  interface  between  the  absorber  and  the  computation  domain  can  be  made  refiectionless  for  “8  ' 
incidence  and  polarization,  leading  to  a  mesh  truncation  scheme  with  much  less  reflection  than 
absorbing  boundary  conditions.  The  scheme  was  implemented  in  a  3-D  tetrahedral  finite  element  code  and 
appUed  to  solving  a  TEM  waveguide  problem  and  a  sphere  scattering  problem. 


1  Introduction 


Traditionally,  the  truncation  of  finite  element  solution  domains  has  been 

local  or  global  boundary  operators  to  the  outer  surface  of  the  finite  element  mesh.  There  is  a  tradeoff  involved 
between  the  accuracy  of  global  operators  (boundary  integral  solutions)  and  the  efficiency  of  local  operators  (ABC 
Alternative  methods  of  truncation  based  on  placing  a  layer  of  absorbing  material  at  the  ^-^ary  of  ^ 

solution  domain  have  been  widely  investigated  in  the  past.  These  absorbers  posses  “ 

local  boundary  operators:  they  preserve  the  sparse  structure  of  the  system  matrix  generated  by  the  finite  elemen 
method.  Unfortunately,  their  accuracy  is  also  similar  to  that  of  local  boundary  operators. 


Recently  J  P  Berenger  [1]  introduced  a  high  performance  absorbing  layer  for  2-D  FDTD  simulations  by  splitting 
“id  component  ^  two  subcomponits  (id.  =  id.  +  id.).  This  “Perfectly  Matched  I.ay«”  £ML 
approach  yields  a  refiectionless  interface  between  free  space  and  the  artificial  absorber  Chew  and  W«don  [2] 
dL^cribed  an  alternative  method  of  deriving  a  perfectly  matched  layer  based  on  complex  coordinate  stretching 
along  the  cartesian  axes.  Both  methods  involve  generalizations  of  Maxwell’s  equations  which  introduce  a 
degrees  of  freedom.  However,  neither  of  these  approaches  is  well  suited  for  finite  element  implementations  of  a 

perfectly  matched  layer. 


Fortunately,  a  refiectionless  interface  between  free  space  and  a  lossy  materml  absorber  can 

the  bulk  properties  of  the  material  are  anisotropic.  Specifically,  if  e  and  p  are  appropriately  chosen  complex 
diagonal  tensors  the  impedance  of  the  medium  will  be  independent  of  the  frequency,  polarization,  and  inciden 
Igle  of  tL  wave  at  the  interface.  A  perfectly  matched  layer  of  this  anisotropic  material  can  be  used  for  truncation 
of  finite  element  solution  domains. 


In  this  work  an  anisotropic  absorber  is  presented  and  implemented  to  truncate  FEM  solution  domains  m  3-D.  The 
accuracy  of  the  anisotropic  layer  is  examined  by  solving  a  guided  TEM  wave  propagation  problem  and  computing 
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the  electromagnetic  scattering  from  a  plane  wave  incident  on  a  conducting  sphere. 

2  Anisotropic  Absorbers 

The  development  of  a  perfectly  matched  absorber  using  anisotropic  material  properties  is  briefly  described  in  this 
section.  A  more  detailed  derivation  can  be  found  in  [?].  The  material  properties  of  the  absorbing  medium  are 
assumed  to  be  complex  diagonal  tensors:  e,  /x.  To  match  the  wave  impedance  of  the  absorber  to  that  of  free  space, 
the  following  condition  is  required: 


Thus,  p,  =  Hoj  and  e  =  cof  where 


■  a  0  0 

j  =  0  6  0 

0  0  c 


a,  6,  and  c  are  complex  numbers.  When  a  =  6  =  c  =  1,  the  medium  reduces  to  free  space.  It  can  be  shown 
that  plane  waves  are  eigensolutions  of  Maxwell’s  equations  in  the  diagonally  anisotropic  media  and  the  plane  wave 
propagation  vector  k  is  governed  by  the  following  dispersion  relation: 


i-2  p  l2 

_L  ^  4.  -  i.2 

_  j - ^  ^  ^ 

be  ac  ab 


The  solution  of  (3)  is  an  ellipsoid: 


kx  =  jboV^sin  9  cos  <f> 

ky  =  jkov^sin^sin^ 

kz  =  koV^cosO  (4) 

Next,  the  reflectivity  of  a  planar  interface  between  two  diagonally  anisotropic  media  is  examined.  In  order  to 
determine  the  reflection  and  transmission  characteristics  of  the  interface,  consider  the  problem  of  a  plane  wave 
produced  by  a  source  at  z  =  —  oo  in  region  1  incident  upon  the  interface  at  z  =  0.  The  geometry  of  the  problem 
is  shown  in  Figure  1 . 

The  propagation  vector  k  is  confined  to  the  xz  plane,  so  the  general  solution  to  the  dispersion  relation  (3)  is: 


kx  =  koy/^sinO 

=  0 


ko\/^cos  9 


In  each  region,  any  arbitrary  plane  wave  whose  direction  is  given  by  (5)  can  be  decomposed  into  a  linear  combination 
of  TE  and  TM  polarizations.  The  E  and  H  fields  for  the  case  of  TE  polarization  are: 
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Figure  1:  Plane  wave  incidence  on  an  interface  between  two  diagonally  anisotropic  half-spaces. 


Ei{x,  z) 
Bt{x^  z) 
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^TE  f_. /¥  cos  @,X  +  ,/5  sine, :^e-i‘o(v^sin9.r+v/S^Uco,»..) 

\  \  02  V  C2  y  T/0 


(6) 


Similarly,  the  E  and  H  fields  for  the  case  of  TM  polarization  are: 


Ei{x,  z) 


9iz) 


Er{x,z)  = 


\/  —  cos  9rX 

V  fll 


£"06 


— jfcoCV^TcT sin  0r2r+'s/Ol  6l  cos  ^rS) 


Et{x,z) 


Eoe 


-jifco(V'^’3C2  5in  ^fr+->/a2^’2  cos^<z) 


i?r(x,z) 

H,ix,z) 


Eq  -  ~jko(\/biCi&in  d^r+v'cifti  cos 

»/0 

v'feiCi  sin  ^ra7+\/ai6i  cos  Or-s) 
j-rM  -^0  ^^-iko(y/b2Cys'm  SiX+y/a^cosetz) 

Vo 


(7) 
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Now,  the  reflection  coefflcient  for  TE  and  TM  polarizations  can  bejdeterniined  by  enforcing  boundary  conditions 
at  the  interface.  In  the  TE  case,  enforcing  tangential  continuity  of  E  and  H  at  the  interface  requires  matching  the 
magnitude  and  phase  of  the  Ey  and  Hx  components  of  the  fields  in  both  regions  at  z  =  0.  Matching  the  Ex  and 
Hy  components  is  required  in  the  TM  case.  Magnitude  matching  in  both  TE  and  TM  cases  results  in  the  following 
relations: 


1  + 

1  -j- —  'j'TM 


Phase  matching  results  in: 
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Equations  (12)  and  (13)  reduce  to  similar  forms.  These  result  can  be  used  to  create  a  perfecUy  matched  absorbing 
medium  which  will  damp  any  incident  wave  radiating  from  the  adjacent  free  space  region  without  reflecting 
radiation  back  from  the  interface.  Equation  (14)  indicates  that  only  one  complex  number  is  needed  to  characterize 
the  absorbing  region.  For  example,  the  diagonal  tensor  7  required  in  the  absorbing  region  associated  with  an 
interface  in  the  xy  plane  is; 


a  0  0 

0  a  0 

0  0- 
a 


(15) 


If  o  is  a  complex  number  of  the  form  a  =  a-  j0,  where  o  and  0  are  positive  real  numbers,  the  transmitted  wave 
will  be  damped: 


The  parameter  o  determines  the  wavelength  of  the  transmitted  wave  in  the  absorbing  region  and  the  parameter 
p  determines  the  rate  of  attenuation  in  the  absorbing  region.  It  should  be  noted  that  an  absorber  of  this  form  is 
not  a  passive  medium  since  the  z  component  of  electric  and  magnetic  conductivity  tensors  is  negative. 

For  practical  purposes,  the  absorbing  region  must  have  a  finite  thickness.  Therefore,  the  termination  of  the 
absorbing  region  results  in  reflection  of  energy  back  into  the  free  space  region.  The  overall  reflection  coefficient 
R{d)  of  a  finite  thickness  absorber  with  termination  is  given  by: 


ii?(^)|  =  ii?T(0)|e'2/3iocos^d  (17) 

where  Rt{0)  is  the  reflection  coefficient  of  the  termination.  If  the  absorber  is  terminated  by  a  perfect  electric 
conductor,  |iilT(^)|  =  1- 


3  Finite  Element  Implementation 

Finite  element  implementation  of  the  anisotropic  absorber  in  the  frequency  domain  is  straightforward.  In  this 
work  a  tetrahedral  vector  finite  element  code  was  modified  to  accommodate  the  diagonal  tensor  material  properties 
requiUd  for  implementation  of  the  absorber.  The  method  is  based_on  an  E  field  formulation  using  Whitney  1- 
Forms  (edge  elements)  as  a  basis  for  the  test  and  trial  functions  (£  and  W).  It  is  convenient  to  terminate  the 
absorbing  regions  with  a  metal  backing,  since  the  surface  integral  term  in  the  weak  form  of  the  wave  equation 
vanishes.  The  functional  (weak  form)  becomes: 


=  J  xW)-{[pir]~^V  X  E)- klW  ■l€r]E^dV  (18) 

The  Galerkin  testing  procedure  results  in  a  complex,  symmetric  linear  system: 


(S-JfcgT)e  =  f  (19) 

Sij  -  I  {V  XWi)-  X  Wj)dV 

Jn 

T,-,-  =  [  Wi  •  frWjdV  (20) 

Jn 
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Wi,  Wj  are  edge  element  basis  functions.  The  most  efficient  solver  for  (19)  appears  to  be  the  Bi-Conjugate  Gradient 
method  with  diagonal  scaling. 


4  Numerical  Results 


Numerical  studies  were  conducted  using  two  example  problems.  TEM  wave  propagation  in  a  simple  parallel  plate 
waveguide  was  considered  as  well  as  the  electromagnetic  scattering  from  a  conducting  sphere.  The  geometry  of 
the  example  problems  is  shown  in  Figure  2  below. 


PEC 

truncation 


PECbKkii^ 


Figure  2:  Geometry  of  TEM  and  sphere  scattering  examples 


4.1  TEM  waveguide 

The  side  walls  of  the  structure  are  perfectly  conducting.  The  structure  is  terminated  by  a  metal  backed  absorbing 
layer.  A  TEM  wave  is  excited  at  the  opposite  end.  Two  example  results  for  this  problem  are  shown  in  the  next 
figure  and  compared  to  the  exact  solutions. 

The  TEM  waveguide  problem  revealed  a  drawback  of  using  the  anisotropic  absorber  for  mesh  truncation.  Specif¬ 
ically,  the  matrix  condition  number  grew  as  the  parameter  P  was  increased.  This  led  to  unacceptable  rates  of 
convergence  in  the  iterative  matrix  solver.  Further  investigation  revealed  that  the  condition  number  is  sensitive 
to  the  value  of  a  as  well  as  (3.  Based  on  a  convergence  study  using  the  TEM  waveguide  problem,  a  near  optimal 
choice  of  ot  appears  to  be  o  —  /?. 

4.2  Scattering  by  a  conducting  sphere 

Scattering  from  a  perfectly  conducting  sphere  of  radius  0.4A  was  used  to  characterize  the  performance  of  the  ab¬ 
sorber.  The  total  electric  field  present  in  the  problem  domain  is  the  sum  of  incident  and  scattered  field  components: 
E*  =  E*  E\  The  incident  field  is  a  plane  wave  traveling  in  the  +z  direction,  polarized  in  the  x  direction.  The 
scattered  electric  field  is  used  as  the  unknown  quantity  in  the  FEM  field  computation  instead  of  the  total 
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Sample  result  for  1D  example 


Figure  3:  Result  and  exact  solutions  for  /?  =  1.0  and  =  1.5 


electric  field.  The  excitation  due  to  the  incident  plane  wave  is  incorporated  via  the  boundary  condition  on  the 
surface  of  the  sphere,  where  =  —E\. 

For  this  problem,  the  Je  component  of  the  surface  current  on  the  sphere  was  compared  to  the  exact  series  solution. 
The  results  were  obtained  using  meshes  with  a  density  of  approximately  6-  elements  /  wavelength.  The  performance 
of  the  absorber  was  characterized  by  studying  the  effect  of  several  parameters  on  the  accuracy  of  the  surface  current 
calculation. 


First  the  effect  of  the  parameter  (3  on  the  solution  accuracy  was  studied.  For  this  comparison,  the  thickness  of  the 
absorber  was  0.3A  and  the  absorber  was  placed  0.3A  away  from  the  surface  of  the  sphere.  /?  was  varied  between 
0.5  and  3.0.  There  was  no  additional  accuracy  to  be  gained  by  using  a  value  of  /?  greater  than  3.0  since  the  rate  of 
damping  was  limited  by  the  coarse  mesh  in  the  absorbing  region.  In  this  range  for  the  reflection  coefficient  for 
normal  incidence  varies  from  0.15  to  1.2  x  10'^  The  surface  current  distribution  for  varying  values  of  is  shown 


in  Figure  4. 


Accuracy  vs.  Beta 


Figure  4:  Je  component  of  the  surface  current 

Next,  the  solution  accuracy  was  examined  as  the  absorber  thickness  was  varied.  In  this  study,  a  ^  value  of  1.5 
was  used  and  the  absorber  was  again  placed  0.3 A  away  from  the  sphere.  The  thickness  of  the  absorber  was  varied 
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from  0.1  A  to  0.5A,  resulting  in  reflection  coefficients  in  the  range  from  0.15  to  8.1  x  10  ®  for  normal  incidence.  The 
resulting  current  distributions  are  plotted  in  Figure  5. 

Accuracy  vs.  absorber  thickness 


theta 


Figure  5:  Sphere  scattering  solution  for  varying  absorber  width 


5  Conclusions 

A  new  method  of  mesh  truncation  for  frequency  domain  finite  element  analysis  was  developed.  The  method  is  based 
on  using  an  artificial  anisotropic  absorber  whose  material  properties  are  complex  diagonal  tensors.  If  the  tensors 
are  chosen  appropriately,  the  interface  between  the  computational  domain  and  the  absorber  will  be  reflectionless 
for  all  angles  of  incidence.  This  approach  offers  the  potential  for  much  lower  reflection  coefficients  than  other  local 
absorbing  boundary  conditions. 

The  new  method  of  mesh  truncation  was  implemented  and  verified  using  two  example  problems.  The  prelimi¬ 
nary  results  appear  promising.  Reasonable  performance  can  be  achieved  with  relatively  thin  absorbing  layers  (as 
little  as  two  elements  thick).  Also,  because  there  is  no  surface  integral  term  in  the  metal  backed  implementation, 
the  method  is  easy  to  implement  and  is  well  suited  for  parallel  computations.  The  drawback  of  this  method  is 
that  the  resulting  linear  systems  are  ill-conditioned,  which  sometimes  causes  convergence  problems  with  iterative 
solvers. 
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Abstract 

A  modification  is  proposed  to  Berenger’s  perfectly  matched  layer  (PML)  numerical  grid  trun¬ 
cation  method  to  provide  for  efficient  absorption  of  surface  waves  in  the  direction  transverse  to  the 
interface  that  supports  them.  Analytic  results  from  wave  propagation  in  dielectric  slab  waveguides 
and  numerical  results  from  the  dispersion  analysis  of  a  microstrip  line  using  the  finite- difference 
time-domcdn  method  are  used  to  show  that,  unless  this  modification  is  effected,  the  PML  layer  can¬ 
not  be  brought  arbitrarily  close  to  the  interface  that  supports  the  surface  wave  without  affecting 
the  propagation  characteristics  of  the  wave. 


1.  Introduction 

Berenger’s  perfectly  matched  layer  (PML)  approach  to  the  truncation  of  FDTD  grids  has  been 
used  successfully  in  conjunction  with  electromagnetic  radiation  and  scattering  problems  in  two  and 
three  dimensions  [1-3].  An  alternative  approach  to  the  construction  of  the  PMLs,  directly  from 
Maxwell’s  equations  using  streched  coordinates,  was  presented  also  in  [4].  However,  up  to  this 
point  there  has  been  no  detail  investigation  of  the  performance  of  PMLs  with  regards  to  absorption 
of  surface  waves  of  the  type  encountered  in  layered  media  and  waves  with  evanescent  behavior 
transverse  to  their  direction  of  propagation  tytpically  encountered  in  practice  in  conjunction  with 
various  types  of  planar  waveguiding  structures.  Except  from  a  brief  discussion  in  [3],  there  has  been 
no  careful  investigation  of  the  effect  that  a  PML  layer,  positioned  parallel  to  the  media  interface 
that  supports  a  surface  wave,  might  have  on  the  propagation  characteristics  of  the  wave,  especially 
when  the  layer  is  brought  close  to  the  interface. 

Results  in  the  literature  from  FDTD  simulations  of  radiation  and  scattering  problems  indicate 
that  the  PMLs  can  be  brought  very  close  (as  close  as  2  FDTD  cells)  to  the  radiator/scatterer.  Such 
a  capability  is  highly  desirable  for  the  FDTD  analysis  of  integrated  microwave,  millimeter  wave  and 
optical  circuits,  in  order  to  keep  computer  memory  requirements  at  a  minimum  without  sacrificing 
the  numerical  accuracy  of  the  simulations.  Such  circuits  include  a  variety  of  resonant  structures, 
the  electromagnetic  behavior  of  which  is  dependent  on  the  dispersive  propagation  characteristics  of 
the  waves  supported  by  the  waveguiding  sections  that  comprise  them.  Cosequently,  it  is  important 
to  investigate  and  comprehend  the  impact  of  PMLs  on  the  dispersive  characteristics  of  planar 
waveguiding  structures  before  any  circuit-level  FDTD  simulations  using  PMLs  can  be  performed 
with  confidence. 
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A  first  attempt  to  this  investigation  is  presented  in  this  paper.  Through  an  analytic  study  of 
the  eigenvalue  problem  of  wave  propagation  in  a  dielectric  slab  waveguide  with  PMLs  present  on 
either  side  of  the  slab,  we  demonstrate  that  Berenger’s  PML  disturbs  the  efective  index  of  the 
propagating  wave.  Furthermore,  we  introduce  a  modification  to  the  original  PML  that  is  shown  to 
alleviate  this  difficulty.  The  numerical  implementation  of  the  modified  PML  is  presented  also. 


2.  Theory 


Using  the  stretched  coordinate  approach  proposed  in  [4],  Maxwell’s  curl  equations  are  written 
as 


(1) 

=  V.xf  (2) 


where 


V. 


^  $xdx  ^  Sy  dy  ^  Sz  dz 


(3) 


We  restrict  our  attention  to  the  two-dimensional  case  with  d/dy  =  (i.  As  it  is  well  known,  Maxwell’s 
equations  decouple  into  two  independent  sets,  one  involving  the  field  components  ify,  Ex,  and  Eg 
(TM  polarization),  and  one  involving  the  field  components  Eyy  Hx,  Hz  (TE  polarization).  The 
pertinent  equations  for  the  TM  polarization  are 


-  ^  ^  H 

1  d 

(4) 

=  -lAjy 

dz  ^ 

(5) 

(6) 

(7) 

where  the  split- component  formalism  used  with  Berenger’s  PML  condition,  Hy^,  Ex^  and 
has  been  used.  In  order  to  derive  the  wave  equation  for  the  TM  polarization,  we  substitute  Eq.  (4) 
in  Eq.  (6)  and  Eq.  (5)  in  Eq.  (7),  giving 


lAlAjy 

Sx  dx  Sx  dx  ^ 

(8) 

-u^lieHy,  = 

$z  dz  Sz  dz  ^ 

(9) 

Since  {Hy^  4-  Hy^)  =  Hy  we  obtain 

2  1^1 

d  „  1  d  1  d 

(10) 

-u^fieHy  =  —^ - 

3xdx  Sx 

dx  ^  Sz  dz  Sz  dz  ^ 
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Next,  we  consider  the  eigenvalue  problem  associated  with  the  TM-modes  of  the  slab  dielectric 
waveguide  shown  in  Figure  1.  Nonmagnetic  materials  are  assumed.  The  waveguide  has  thickness 
2w,  constant  permittivity  ei,  and  its  axis  coincide  with  the  j?  axis.  Due  to  the  symmetry  of  the 
structure,  only  the  top  half  is  shown.  A  slab  of  thickness  I  and  permittivity  €2  <  ci  represents  the 
medium  outside  the  guide.  Notice  that  =  s  =  1  for  both  the  guide  and  the  adjacent  slab. 

Beyond  the  €2  slab  we  introduced  an  M-layer  structure  terminated  by  a  perfectly  conducting  plane. 
All  layers  in  the  structure  have  the  same  permittivity,  €2;  however,  their  thicknesses,  dm,  and  the 
values  of  are  allowed  to  vary,  while  =1.  Let  /3  be  the  unknown  propagation  constant 

for  even  TM-mode  propagation  in  A  straightforward  analysis  of  the  eigenvalue  problem  results 
in  the  following  eigenvalue  equation 


t  (h  t  1  +  exp(-2/;a:J)exp(-2fcx3  Sxjdi) 

I  -  exp{-2k2;J)ex-p(-2kj;2Yliii^xidi)^ 


(11) 


where  kl^  =  u^fioei  -  and  kl^  =  (S^  ~  wVo€2.  For  propagating  modes,  is  real  and  positive. 
At  this  point,  it  is  appropriate  to  recall  that  the  eigenvalue  equation  for  the  even  TM  modes  for  the 
case  of  the  standard  slab  waveguide  of  thickness  2w  and  ci  embedded  in  a  homogeneous  medium 
€2  is  given  by  tan(fciiy)  =  (fc2fi)/(^i^2)*  Thus  the  term  in  brackets  in  Eq.  (11)  may  be  thought  of 
as  the  “error”  term  caused  by  the  truncated  domain  and  the  presence  of  the  layered  structure.  It 
can  be  seen  that  as  /  ^  00  in  Eq.  (11),  the  eigenvalue  equation  for  the  standard  slab  waveguide 
is  recovered.  This  equation  also  suggests  that  instead  of  having  I  ->  00  the  eigenvalue  equation 
for  the  standard  waveguide  can  be  reclaimed  by  letting  assume  large  values.  How  large  these 
values  have  to  be  is  dependent  on  the  problem.  Further,  we  notice  that  needs  to  be  positive  to 
cause  the  error  terms  to  become  negligible  compared  to  1.  At  this  point,  it  is  appropriate  to  recall 
that  for  Berenger’s  PML  is  of  the  form  =  1  -  j(cT:/we,)  and,  as  such,  it  does  not  cause  any 
additional  attenuation  (since  kj;^  is  real)  to  help  reduce  the  error  term  in  Eq.  (11). 

As  an  example,  consider  a  guide  with  a  core  of  2ii;  =  1.0  cm  and  index  of  refraction  of  ni  =  1.3, 
surrounded  by  air.  Fig.  2  shows  the  results  of  Eq.  (11)  for  the  efective  index  of  refraction, 
«e//  =  for  free-space  wavelength  of  3.0  cm,  I  =  0.5  cm,  and  a  10  cell  layer  with  each 

layer  being  1.0  mm  thick  and  having  a  real  value  of  given  by  the  equation  Sxi  —  Sayg  (i/10)Hl, 
i  =  1,2,  - . 10.  The  dotted  line  denotes  the  value  of  ng//  for  the  case  of  the  standard  problem  where 
the  syrrounding  medium  is  homogeneous  and  infinite  in  extent.  This  value  is  ng//  =  1.07941.  This 
plot  suggests  that,  by  a  proper  choice  of  Savg,  it  is  possible  to  recover  a  solution  corresponding  to 
the  open  computational  domain  and  recover  a  solution  corresponding  to  the  open  domain  problem, 
using  a  small  computational  domain.  More  specifically,  Figure  2  suggests  that  a  value  of  Savg  =  10 
should  give  good  agreement  with  the  analytical  solution  of  the  infinite  domain  problem.  Notice 
that  the  computational  domain  in  the  x  direction  would  then  be  20  cells  in  extent  (5  in  the  guide, 
5  in  the  air  medium,  and  10  in  the  modified  PML  medium). 

For  the  sake  of  comparison,  Fig.  3  shows  the  results  of  Eq.  (11)  when  each  Sxi  =  1.0.  As 
mentioned  earlier,  the  presence  of  an  imaginary  part  in  of  the  form  ailu€i,  would  have  no 
impact  in  reducing  the  error  term  in  Eq.  (11).  The  independent  variable  is  the  number  of  cells  in 
addition  to  the  5  cells  used  to  model  the  air  medium  around  the  guide.  We  see  that  in  order  to  get 
an  accurate  solution  approximately  40  extra  cells  are  needed.  This  means  the  the  computational 
domain  in  the  x-direction  would  have  to  be  50  cells.  This  represents  over  a  doubling  of  the  domain 


in  the  x-direction. 
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3.  Nunerical  Implementation 

The  aforementioned  analysis  suggests  the  use  of  s  values  in  the  streched- coordinate  formulation 
of  the  PML  with  J2e{s}  >  1  in  order  to  facilitate  rapid  absorption  of  evanescent  waves  without 
affecting  their  propagation  characteristics.  Therefore,  in  order  to  encompass  the  general  case  of 
planar  structures  that  have  both  radiating  and  waveguiding  properties,  we  allow  s  to  assume 
complex  values  with  real  part  greater  than  1.  More  specifically,  Si  takes  the  form 


(12) 


where  i  =  x,y,z.  Notice  that  in  addition  to  the  real  part  of  Sj,  sj,  being  greater  than  1,  the  ratio 
(Xilepsilon  is  scaled  by  sj  also.  With  this  notation  for  Sj-,  the  modified  Maxwell’s  system  for  the 
TM  case  takes  the  form 


{‘dt 


Hy. 


{“Fy-f)"- 


(13) 

(14) 

(15) 

(16) 


For  a  numerical  application  of  the  aforementioned  modified  system,  we  consider  the  FDTD 
simulation  of  wave  propagation  in  our  model  slab  waveguide.  From  Fig.  2  we  see  that  using  a  value 
of  =  10  should  give  good  agreement  with  the  analytical  solution.  We  excited  the  structure  with 
the  first  even  TM  mode  and  used  the  FDTD  generated  time  histories  to  calculate  numerically  the 
effective  index  for  the  mode.  The  grid  size  was  1.0  mm,  and  the  modified  PML  was  placed  5  cells 
away  from  the  core.  The  calculated  numerical  value  for  was  1.07875  which,  compared  to  the 
analytic  value  of  Ue/f  =  1.07941,  exhibits  only  0.06%  difference.  Furthermore,  the  mode  profile 
obtained  from  the  FDTD  simulation  is  almost  indistinguishable  from  the  analytic  one  outside  the 
PML  region.  We  note  that  the  magnitude  of  the  normalized  field  at  the  PML  interface  is  around 
0.46,  showing  that  it  is  not  necessary  to  let  the  field  decay  significantly  before  introducing  the 
modified  PML. 


4.  Conclusions 

In  conclusion,  we  have  shown  that  the  originaUy  proposed  PML  does  not  facilitate  absorption 
of  surface  waves  in  the  direction  transverse  to  their  direction  of  propagation.  More  importantly,  it 
can  alter  their  propagation  characteristics  if  brought  close  to  the  material  interface  that  supports 
the  waves.  To  provide  for  evanescent  wave  absorption,  we  proposed  a  modified  version  of  the 
original  PML  medium.  The  modification  was  effected  using  the  streched-coordinate  formalism  of 
the  PML.  FDTD  simulations  of  wave  propagation  in  dielectric  slab  waveguides  were  used  to  verify 
numerically  the  proposed  modification. 
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On-going  work  examines  the  application  of  the  proposed  modified  PML  to  the  FDTD  modeling 
of  microwave  integrated  circuits  involving  various  types  of  planar  waveguides,  delay  fines,  filters, 
and  radiating  elements. 
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Figure  1:  Slab  waveguide  geometry  for  analytically  soluble  problem. 
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Abstract 

This  paper  reports  the  evaluation  of  the  Perfectly  Matched  Layer  (PML)  as  the  numerical 
absorbing  boundary  in  modeling  wave  propagation  in  microwave  passive  components  and  digital 
circuit  interconnects.  Some  conputationd  issues  on  the  finite-difference  implementation  of  PML  are 
discussed.  Theoretical  analysis  and  nunxrical  tests  show  that  the  PML  is  very  effective  for  absorbing 
TEM  and  quasi-TEM  waves,  as  well  as  non-TEM  waves  somewhat  above  cutoff  frequencies,  but 
ineffective  for  absorbing  evanescent  waves  and  non-TEM  waves  near  cutoff  frequencies. 


L  Introduction 

The  Perfectly  Matched  Layer,  recently  proposed  by  Berenger,  is  an  artificial  absorber  for  the 
truncation  of  nunKiical  conputation  domains  [1-2].  The  unique  feature  of  PML,  compared  to  other 
previously  proposed  lossy  absorbers,  is  that  PML  matches  interior  media  for  outgoing  waves  of  any 
incident  angles  and  frequencies.  The  Perfectly  Matched  Layer  absorbing  boundary  condition,  once 
it  was  proposed,  immediately  drew  large  attention  and  enthusiasm  in  the  computational 
electromagnetic  community.  Several  papers  have  appeared,  shortly  after  Berenger  s  invention,  that 
verified  and  extended  the  method  of  PML  and  applied  it  to  various  disciplines,  such  as  wave 
propagation,  scattering  and  waveguide  structures  [3-4]. 

We  tested  the  performance  of  the  PML  absorbing  boundary  condition  for  both  scattering 
probbms  and  wave  propagation  in  microwave  and  digital  circuit  transmission  components.  It  is  found 
that  PML  exhibits  excellent  performance  for  typical  scattering  problems.  For  modeling  of  wave 
propagation  along  waveguide  components,  PML  works  well  in  some  circumstances,  but  it  can  be 
totally  ineffective  in  other  circumstances,  mainly  in  absorbing  waves  near  cutoff  frequencies  ^d 
evanescent  waves.  This  paper  discusses  the  mechanism  of  the  performance  of  PML  for  propagating 
and  evanescent  waves,  and  presents  the  test  results  of  using  PML  in  modeling  wave  propagation  in 
metal  waveguide  and  microstrip  line  structures. 

n.  Electromagnetic  Waves  in  Perfectly  Matched  Layers 

Consider  the  media  shown  in  Figure  1,  where  the  free-space  is  in  the  region  x<0  and  the  PML 
is  in  x>0.  According  to  Berenger's  papers  [1-2],  parameters  of  the  PML  should  be  chosen  as 


a^/eo  -  cr/p, 
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where  and  Oy  are  electric  conductivities,  o*  and  Oy  are  magnetic  conductivities.  Suppose  a  plane 
wave  propagating  in  the  direction  which  has  an  angle  (p  with  the  x  axis.  As  long  as  the  condition  in 

(1)  is  satisfied,  the  Perfectly  Matched  Layer  would  not  cause  reflection  to  the  incident  wave  from  the 
free-space  for  any  incident  angle  <p  and  frequencies.  Let  \|;  be  any  component  of  the  fields  in  the 
PML,  then  can  be  expressed  as  [1]: 

a_coa<|> 

— i. - X 

If  there  is  no  PML,  the  term  e  will  not  be  there  in  (2). 

a_cos<p 
— = - X 

For  propagating  waves  in  the  x  direction,  coscp  is  a  real  number,  and  the  term  e  in 

(2)  contributes  to  the  attenuation  of  the  fields.  As  this  attenuated  wave  reaches  the  end  boundary  in 
the  x  direction  (typically  a  perfect  electric  wall),  it  is  reflected  and  further  attenuated  as  it  propagates 
back  towards  the  free-space  region.  For  cases  where  costp  is  not  close  to  zero,  with  a  proper 
selection  of  a’s  and  a  sufficient  width  of  the  PML,  waves  in  PML  can  be  attenuated  to  a  sufficiently 
small  value. 

On  the  other  hand,  if  the  variation  of  the  fields  in  the  x  direction  is  of  evanescent  nature,  cos<p 

CT.COS^ 

- * - X 

will  be  an  imaginary  number,  and  the  term  e  no  longer  decays  in  the  x  direction.  In  this  case, 

the  presence  of  the  PML  does  not  add  any  additional  attenuation  of  the  wave  beyond  the  original 
attenuation  of  the  evanescent  wave  itself. 

From  the  nature  of  the  PML  discussed  above,  one  can  predict  that,  for  the  modeling  of  wave 
propagation  in  waveguide  components,  PML  can  be  effective  for  waves  somewhat  above  the  cutoff 
frequencies,  but  ineffective  for  waves  near  cutoff  frequencies  (when  cos<p  is  close  to  zero)  and 
evanescent  waves  (when  cos<p  is  an  imaginary  number).  Numerical  tests  presented  in  later  sections 
of  this  paper  confirm  this  prediction. 

ni.  Time-Marching  Schemes  in  Perfectly  Matched  Layers 

The  exponential  time  marching  scheme  was  used  in  the  papers  of  Berengef  s  [1-2],  and  was 
also  used  and  advocated  by  some  other  investigators  [3-4].  Our  comparisons  made  from  numerical 
tests  show  no  noticeable  differences  between  the  exponential  time-marching  scheme  and  the  standard 
Yee's  scheme.  The  reason  for  this  can  be  explained  as  follows.  Inside  PML,  fields  do  decay  rapidly 
in  space,  but  not  in  time.  Therefore,  it  is  not  essential  to  use  the  exponential  time-marching  scheme 
to  achieve  the  high  absorbing  property  in  applying  PML. 

Figure  2  shows  the  numerical  reflection  coefficients,  with  normal  incident  angle,  of  a  8-cell 
PML  irtplemented  with  the  exponential  time-marching  scheme  and  the  standard  Yee's  scheme.  The 
numerical  reflection  coefficient  is  obtained  by  the  ratio  of  the  Fourier  transformed  reflected  field  and 
the  incident  field.  In  this  test,  the  analytic  reflection  coefficient  of  the  PML  are  chosen  as  10  and 
10® respectively,  and  the  variation  of  the  conductivity  with  space  is  parabolic.  As  can  be  seen  from 
Hgure  2,  there  is  virtually  no  difference  on  the  absorbing  property  of  PML  between  the  exponential 
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time-marching  scheme  and  the  standard  Yee’s  scheme.  Same  observation  is  obtained  when  the 
incident  wave  is  not  of  normal  incidence  angle. 

rv.  PML  Applied  to  Waveguide  Components 

First  consider  a  parallel-plate  waveguide  shown  in  Figire  3.  The  separation  between  the  two 
metal  plates  is  a  =  40  mm.  The  lowest  order  mode  of  this  structure  is  the  TMq  mode  which  is 
actually  a  TEM  mode  with  a  cutoff  frequency  at  dc.  The  propagation  of  the  TMq  mode  fields  along 
the  parallel-plate  waveguide  is  essentially  a  one-dimensional  wave  propagation  problem.  The  PML 
boundary  condition  can  absorb  outgoing  waves  very  well  in  an  ultra-wide  frequency  spectrum,  as 
reported  in  [4]. 

The  lowest  order  non-TEM  mode  in  the  waveguide  shown  in  Figure  3  is  the  TM,  or  TEi 
mode  which  has  a  cutoff  frequency  of  3.75  GHz.  First  let  us  examine  how  the  TM,  mode  wave  is 
attenuated  in  the  PML.  Choose  the  conductivity  of  the  PML  to  be  constant  with  space.  From  (1), 
it  can  be  easily  found  that  the  attenuation  constant  of  the  TM^  mode  in  the  waveguide  is: 


a. 


(-) 

a 


k  \ 


(f  <  f^) 
{f  >  fj 


(3) 


When  f  <  fc,  the  attenuation  in  PML  is  the  same  as  that  in  air-filled  waveguide.  When  f  >fc,  the 
attenuation  is  due  to  the  contribution  of  PML.  Expression  (3)  can  be  numerically  verified  by 
launching  a  TMj  mode  in  the  waveguide  filled  with  PML  medium,  and  extracting  the  attenuation 
constant  of  the  wave  propagating  along  the  waveguide.  Let  us  choose  the  constant  conductivity  of 
the  PML  to  be  =  0.1  S/m.  Figure  4  shows  that  the  attenuation  constant  expressed  in  (3)  agrees 
very  well  with  that  extracted  from  numerical  solutions  of  the  wave  in  the  waveguide  filled  with  PML. 

Figure  5  displays  the  numerical  reflection  coefficient  of  the  PML  versus  frequency  for  an 
incident  wave  of  TM,  mode.  The  conductivity  profiles  in  PML  are  all  chosen  to  be  parabolic.  It  can 
be  seen  from  Figure  5  that,  unlike  the  case  for  the  TMq  mode,  PML  performs  weU  only  when  the 
frequency  is  somewhat  above  the  cutoff  frequency.  The  total  reflection  is  reached  near  the  cutoff 
frequency.  When  the  frequency  is  less  than  the  cutoff  frequency,  the  attenuation  of  the  wave  in  PML 
makes  the  reflection  coefficient  less  than  1.  Notice  that  the  PML  does  not  contribute  additional 
attenuation  to  the  evanescent  waves. 

Next  let’s  consider  a  regular  rectangular  waveguide.  The  width  of  the  longer  side  of  the 
rectangular  waveguide  is  chosen  as  40  mm.  The  numerical  reflection  coefficients  for  TEjq,  TE20  and 
TE30  modes  are  shown  in  Figure  6.  Similar  conclusions  can  be  drawn  from  results  of  Figure  6  as 
those  from  Figure  5.  In  order  to  achieve  low  reflection  in  the  spectrum  below  cutoff  frequencies, 
PML  needs  to  be  placed  sufficiently  away  from  sources  or  discontinuities  to  let  evanescent  waves 
fully  die  out 
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V.  PML  Applied  to  Microstrip  Lines 

To  simulate  a  wave  propagation  along  a  microstrip  line,  PML  is  placed  near  the  outer 
computation  domain  as  shown  in  Figure  7.  At  the  end  surfaces  perpendicular  to  the  metal  strip, 
where  outgoing  waves  incident  upon  outer  boundaries  in  near  normal  incident  angles,  PML  is  found 
to  be  very  effective.  On  the  top  and  the  side  surfaces  of  the  computation  domain,  where  fields  are 
mostly  evanescent  in  the  direction  normal  to  outer  surfaces,  PML  is  found  to  be  less  effective. 

Figure  8  shows  the  percentage  errors  in  the  computed  voltage  of  the  microstrip  line  for 
different  widths  of  the  PML  placed  at  different  distances  away  from  the  center  of  the  microstrip.  The 
reference  data  for  the  voltage  is  obtained  with  a  very  large  computation  domain.  Figure  9  is  the 
corresponding  confuted  effective  dielectric  constant  of  the  microstrip  line.  As  can  be  seen  from 
Figure  9,  PML  needs  to  be  placed  at  certain  distance  away  from  the  metal  strip  and  of  enough 
thickness  to  ensure  the  reliability  of  the  computed  electrical  properties  of  the  microstrip  line. 

VI.  Conclusion 

This  paper  presents  the  performance  of  PML  for  modeling  wave  propagations  in  waveguides 
and  microstrip  lines.  It  is  demonstrated  through  theoretical  analysis  and  numerical  tests  that  PML 
is  very  effective  for  absorbing  outgoing  propagating  waves  somewhat  above  cutoff  frequencies,  but 
ineffective  in  regions  near  and  below  cutoff  frequencies.  To  terminate  computation  boundaries  where 
evanescent  waves  are  significant,  PML  needs  to  be  placed  sufficiently  away  from  modeled  objects  to 
let  evanescent  waves  die  out. 
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Figure  1.  The  region  of  free-space  on  the  left  and  the  PML  on  the  right.  The  incident  wave 
propagates  in  the  direction  which  has  an  angle  tp  with  the  x  axis. 


Frequency  (GHz) 

Figure  2.  Comparison  of  reflection  coefficients  of  8-cell  PML  implemented  with  the 
exponential  time-marching  scheme  and  Yee's  scheme.  The  analytic  reflection  coefficients 

of  PML  are  chosen  as  10’^  and  10 


y 

Metal  I  ^ 


Metal  X 

Figure  3.  Configuration  of  a  parallel-plate  waveguide. 
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Numerical 


Figure  4.  Attenuation  constant  in  PML,  The  dash  line  is  the  analytic  solution  expressed  in  eqn  (3). 
The  solid  line  is  obtained  from  the  numerical  solution  of  a  wave  propagating  in  PML. 


Figure  5.  Reflection  coefficients  of  TMl  mode.  The  width  of  the  PML  is  16, 8  and  4  cells 

respectively.  The  analytic  reflection  coefficients  of  the  PML  are  all  chosen  as  10  . 
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Figure  6.  Reflection  coefficients  of  TEIO,  TE20  and  TE30  modes.  The  width  of  the  PML  is  of 
16  cells,  and  the  analytic  reflection  coefficients  of  the  PML  are  all  chosen  as  10  . 
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Figure  7.  Cross  section  of  the  computation  domain  for  a  microstrip  line.  Due  to  the  symmetry 
of  the  structure,  wave  propagation  in  half  of  the  physical  structure  is  simulated. 
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Figure  8.  Percentage  errors  in  the  computed  voltage  of  the  microstrip  line.  The  computation 
domain  of  the  cross  section  is  of  20  by  20  or  14  by  14  cells.  The  width  of  the  PML  is  8  or  4  cells. 
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Figure  9.  Numerically  computed  effective  dielectric  constants  of  the  microstrip  line.  The  computation 
domain  of  the  cross  section  is  of  20  by  20  or  14  by  14  cells.  The  width  of  the  PML  is  8  or  4  cells. 
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1  Abstract 

A  3-D  finite  volume  algorithm  for  the  Maxwell’s  equations  has  been  developed  on  generalized 
body  conformal  curvilinear  grids.  The  algorithm  is  explicit  in  nature  and  is  especially  suited  to  a 
massively  parallel  environment.  It  utilizes  the  four  stage  Runge-Kutta  time  integration  method. 
Integration  of  the  Maxwell’s  equations  is  carried  out  on  a  dual  grid  wherein  the  electric  and 
magnetic  field  quantities  are  evaluated  on  different  grids.  The  algorithm  has  been  successfully 
tested  on  the  CM-200/CM-5  in  solving  scattering  problems  of  electromagnetic  waves  from  various 
targets.  Results  are  presented  for  the  problem  of  scattering  from  a  perfectly  conducting  sphere 
and  a  perfectly  conducting  ogive.  These  results  are  in  extremely  good  agreement  with  the  exact 
solution  and  with  results  obtained  with  a  standard  finite  difference  time  domain  code.  The 
formulation  used  in  this  case  is  for  the  scattered  field  and  the  Liao  boundary  condition  is  used 
at  the  outer  non-reflecting  boundary.  The  far  zone  transformation  has  also  been  implemented 
efficiently  in  a  parallel  manner  to  evaluate  the  far  zone  scattering  results. 

2  Introduction 

The  Finite  Difference  Time  Domain  (FD-TD)  effort  in  computational  electromagnetics  has  been 
primarily  geared  towards  efficient  development  of  the  Yee  algorithm  [1]  on  a  staggered  uniform 
grid.  Although  this  algorithm  is  robust,  efficient  and  lacks  the  propagation  of  spurious  waves  due 
to  non-uniform  grid  size,  it  is  prone  to  staircasing  errors.  In  order  to  eliminate  these  staircasing 
errors  a  host  of  finite  difference  and  finite  volume  algorithms  [2]- [6]  have  been  developed  for  body- 
conformal  orthogonal  grids.  However,  most  of  these  algorithms  are  limited  by  low  CFL  numbers 
or  cannot  be  exploited  in  a  data  parallel  manner.  In  the  present  paper  we  make  use  of  the  finite 
volume  approach  with  the  Runge-Kutta  time  integration  method  that  has  been  successfully  used 
in  aerodynamic  [7]  and  aero- acoustic  type  problems  [8]- [9].  The  Finite  Volume  Time  Domain  (FV¬ 
TD)  algorithm  is  based  on  the  four-stage  Runge-Kutta  explicit  time  stepping  method  on  a  dual 
grid.  All  three  electric  field  components  are  computed  at  the  same  grid  points  and  three  magnetic 
field  components  are  solved  for  on  the  same  dual  grid  points.  The  reason  for  adopting  a  dual  grid 
based  approach  is  to  avoid  adding  artificial  dissipation  explicitly  (typical  of  central  differencing 
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schemes)  or  make  the  code  dependent  on  algorithmic  dissipation  (upwind/ characteristic  schemes). 
Explicitly  adding  artificial  dissipation  requires  tuning  certain  parameters  to  prevent  odd-even 
decoupling  or  killing  the  unwanted  high  frequency  waves.  Algorithmic  dissipative  schemes,  more 
often  than  not  tend  to  be  too  dissipative,  thereby  requiring  a  very  high  resolution  for  the  waves 
to  appear  as  low  frequency  waves  to  the  grid. 


3  Numerical  Model 


Maxwell’s  equations  can  be  written  in  the  conservative  vector  form  as 


dt 

m 

dt 


■  =  -V  X  E 
=  VxH-j 


(1) 

(2) 


Assuming  the  scatterer  to  be  a  perfect  conductor  and  rewriting  the  equations  in  their  integral 
form  for  free  space  we  have 


Since  the  system  of  Maxwell’s  equations  are  linear,  the  equations  are  solved  only  for  the  scat¬ 
tered  fields  since  the  analytical  solution  for  the  incident  field  is  known.  Henceforth  the  fields  writ¬ 
ten  without  any  superscripts  will  be  understood  to  be  the  scattered  fields  [E  =  H  ). 

If  the  total  or  incident  fields  have  to  be  alluded  to  they  will  be  explicitly  denoted  by  their  respective 
superscripts. 


A  finite  volume  formulation  is  used  to  solve  the  above  system  of  equations.  A  dual  grid  is 
used  in  this  formulation.  Each  E-field  point  is  located  at  the  centroid  of  the  cube  whose  corners 
are  H-field  points  and  similarly  each  H-field  point  is  located  at  the  centroid  of  an  E-field  cube. 
Figure  1  depicts  the  combination  of  a  E-iield  cell  and  an  H-field  cell  that  form  the  dual  grid. 
This  formulation  differs  from  the  Yee  cell  formulation  in  that  although  the  E-field  points  and  the 
H-field  points  are  not  collocated  all  the  E-field  components  are  solved  at  the  same  point  and  all 
the  H-field  components  are  solved  at  the  same  dual  grid  point.  The  dual  grid  is  generated  by  first 
using  a  standard  grid  generator  to  generate  the  grid  for  the  E-held  points  for  a  scatterer  that  is  a 
perfect  conductor  (and  this  grid  would  correspond  to  a  H-held  grid  if  the  scatterer  was  composed 
of  a  magnetic  material).  The  H-field  grid  is  then  generated  from  the  E-field  grid  by  taking  the 
average  of  the  coordinates  of  the  corners  of  each  E-field  cell. 

The  system  is  solved  using  a  four-stage  Runge-Kutta  time  integration  method  given  by 

Q-  =  Q7-7^AtRr'^  m  =  1,2,3,4 
Q-+1  =  Qf  H- 
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The  time  step  is  denoted  by  n  and  each  stage  of  the  Runge-Kutta  method  by  m,  where  the 
coefficients  are  7^  =  1  respectively. 


The  residuals  Rj  are  defined  as 


Ex 

By 

Q2  = 

Ey 

.  Ex  . 

r(AF  + AG  + AK) 


R2  =  -r77(^L  +  AM  +  AN) 
A  Vd 


,;  +  l/2,A:+l/2  ^  ®"i+l,j+l/2,fc+l/2  ' 


■  Sli+l/2,A:+l/2  ^  Elj+l/2,fc+l/2) 


AG  =  -  (S? 


'i+l/2,j+l,fc+l/2 


'z+l/2,j+l,fc+l/2  ' 


AK  =  -  (Sj 

tl.  ^ 


'+l/2,i+l/2,fc+l  ^  ^i+l/2,j+l/2,k+l  ' 


CV  w  fpJ?  \ 

■  ^1+1/2, i,i+l/2  ^  ^i+l/2,jMl/‘2j 


■  ^i+l/2,j-i-l/2,k  ^  ®i+l/2,i+l/2,fc) 


In  the  above  expressions  S^,S^,S^  represent  the  projected  surface  areas  of  constant  4,  ?/,C 
faces  respectively.  They  are  calculated  by  taking  half  the  vector  cross  product  of  the  two  diagonal 
vectors  that  join  the  four  vertices  of  a  cell  face.  For  example  the  area  of  the  surface  abed  as  seen 
in  Figure  2  is  calculated  as 

2  [  {yc-  Va)  {Zd  -  -  {Zc  -  Za)  [Vd  “  Vh) 

=  —  (Zi;  Za)  (^d  ^b)  (^c  ^a)  (z^ 

^  [  (xc  -  Xa)  (yd  -  yb)  ~  (yc  -  ya)  (^d  ~  ^b)  . 

It  should  be  noted  that  the  electric  field  points  used  in  the  evaluation  of  the  above  fluxes  lie  in 
the  center  of  the  cell  faces  and  are  not  computed  directly  from  the  integration  process.  Instead 

they  are  extrapolated  from  the  electric  field  points  that  make  up  the  corners  of  the  cell  faces.  For 

example  from  Figure  3 

^lj+l/2,k+j/2  —  ^  +  ^ij+l,k  +  ^ij,k+l  + 

Kordulla  and  Vinokur  [10]  have  proposed  an  efficient  method  of  calculating  cell  volumes  for 
three-dimensional  flow  predictions.  The  volume  AV  is  calculated  by  partitioning  the  hexahedron 
cell  into  five  tetrahedra  and  computing  the  volume  of  each  tetrahedron  separately.  This  decom¬ 
position  helps  avoid  gaps  and  overlaps  in  computing  cell  volumes  and  is  computationally  very 
efficient. 

The  residual  R2  used  to  evaluate  the  E-field  components  can  be  computed  in  a  similar  manner: 


AL  =  —  Sd: 


^  V  V 

i+l,i+l/2,fc+l/2  ^  ■^:+l,j+l/2,fc+l/2  ^^iJ+l/2,k+l/2  ^  •*^i,i+l/2,fc+l/2, 
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am  =  — -  (Sd-'^l/2,i+l,fe+l/2  ^  ^?+l/2,j+l,fc+l/2  “  ^^:+l/2,i,fc+l/2  ^  ^i+l/2,j,fc+l/2) 

AN  =  — -  (Sd^^l/2j+l/2,fc+l  ^  ^f+l/2,i+l/2,fc+l  “  ®^i+l/2,j  +  l/2,fc  ^  ^i+l/2,i+l/2,fe) 

The  expressions  Sd^  Sd"^,  Sd**  represent  the  projected  surface  areas  of  constant  (  faces  of 
the  dual  grid  respectively.  It  must  be  pointed  out  the  subscripts  i,j,k  used  in  the  calculation  of 
the  above  fluxes  are  subscripts  on  the  dual  grid. 

4  Boundary  Conditions 

We  have  to  consider  the  boundary  conditions  at  the  surface  of  the  scatterer  and  the  radiation 
condition  at  the  outer  boundary.  In  both  cases  we  have  only  the  E-field  points  located  at  the 
boundaries.  The  H-field  points  lie  half  a  cell  away  from  the  boundaries  and  are  considered  as 
interior  points.  Hence,  we  have  to  determine  the  boundary  conditions  for  only  the  E-field  compo¬ 
nents. 


4.1  Surface  Boundary  Condition 

At  the  surface  of  a  perfectly  conducting  scatterer  we  know  that  the  tangential  components  of  the 
total  electric  field  is  equal  to  zero. 

n^  X  -  0 

where  nn  is  the  unit  normal  to  the  surface  of  the  conductor. 


This  condition  can  be  approximated  as 


_  £Ti  j^total  _  q 


S,  Filial  _ 


It  can  be  easily  seen  that  the  above  three  equations  are  not  enough  to  compute  the  three 
components  of  the  electric  field  at  the  surface  of  the  scatterer.  We  need  an  additional  condition 
in  order  to  make  the  system  determinate.  In  this  case  we  take  into  consideration  Gauss’s  electric 
law  which  can  be  written  in  the  integral  form  as 

///,(V.D)JV  =  0  (8) 

or  .  . 

J  j^{n-T))dS  =0  (9) 

We  need  to  apply  Gauss’s  electric  law  to  each  cell  that  lies  on  the  boundary.  Assuming  that  the 
sides  perpendicular  to  the  conducting  surface  have  zero  flux  through  them  (Figure  4)  we  obtain 

g,  .  gto^al  ^  gr,  . 

Combining  this  condition  with  the  fact  that  the  tangential  components  of  the  total  fields  are 
equal  to  zero  we  can  write  explicit  expressions  for  the  scattered  field  components  of  the  electric 
fields  at  the  boundaries. 
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In  the  above  expressions  the  quantities  [5J],  [5J],  [5j]  are  the  average  of  the  respective  surface 
areas  of  the  four  cells  surrounding  the  E-field  point  on  the  boundary. 


4.2  Far  Field  Boundary  Condition 

Most  body  conformal  grid  algorithms  in  electromagnetics  [4] -[6]  use  a  simple  outer  boundary 
condition  that  either  solves  the  one-dimensional  wave  equation  directly  or  is  based  on  the  one¬ 
dimensional  method  of  characteristics  in  cartesian  coordinates.  These  boundary  conditions  are 
analogous  to  the  Mur-type  boundary  condition  that  is  frequently  used  in  computational  electro¬ 
magnetics.  They  generally  work  well  when  the  outgoing  wave  is  a  plane  wave  travelling  perpen¬ 
dicular  to  the  boundary. 

In  order  to  improve  the  outer  boundary  condition  the  Liao  [14]  form  of  the  non- reflecting 
boundary  condition  has  been  implemented.  This  can  be  written  in  its  discrete  form  as 


where 


E^ijmax  = 


(13) 


and 


Tn  = 


{2-s){l-s) 


Tn  —  •s(2  —  5) 


Ti3  = 


s{s  -  1) 


cAt 

— 

An 


where  0  <  o;  <  2 


5  Far  Zone  Transform 

One  of  the  most  important  applications  of  solving  the  Maxwell’s  equations  in  the  time  domain 
is  the  analysis  of  the  scattered  fields  from  radar  targets.  For  this  purpose,  it  is  important  to 
compute  the  far  zone  scattered  fields.  In  this  section  we  have  used  the  method  developed  in 
[13]  and  modified  it  for  the  FV-TD  algorithm.  It  should  be  mentioned  that  although  the  method 
developed  in  [13]  is  quite  generic,  the  discussion  of  its  application  was  limited  to  the  Yee  algorithm. 
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In  [13]  the  authors  show  that  the  retarded  potentials  w  and  u  can  be  computed  from  the  scattered 
surface  currents  using  the  following  formulas: 

W  [t  -  (r  •  f)  /c]  =  (^)  I  j.  it)  (14) 

(15) 

Assuming  that  the  surface  i?  =  constant  denotes  the  integration  surface  for  the  far  zone 
transform  the  equations  can  be  recast  in  discretized  form  as  follows: 

u  [f  _  (f .  f)  /c]  =  (E  X  8”)"+'  -  (Ex  S”)"  (16) 

At  each  time  step  the  contribution  of  each  cell  surface  that  makes  up  the  integration  surface 
■n  =  constant  is  calculated  and  placed  in  its  corresponding  w  bin  whose  index  is  computed  by 
subtracting  the  time  delay  factor  from  the  elapsed  time  and  dividing  the  result  by  the  time  step 
At. 


The  retarded  potential  w  has  to  be  computed  in  a  slightly  different  manner  since  the  magnetic 
field  points  do  not  lie  on  the  surface  rj  =  constant.  Instead,  there  exists  a  magnetic  field  point 
half  a  cell  above  and  half  a  cell  below  the  surface.  Therefore  we  can  take  the  average  of  the  time 
derir’atives  of  the  cell  above  and  below  and  compute  the  retarded  potential  w. 


w  [i  —  (f  •  f)  /c]  =  X 


(Ijn+l  _  +  (H"+'  - 


(17) 


The  far  zone  scattered  fields  can  be  computed  from  the  above  potentials  as 


Es  =  —Zwg  —  U:f,  =  —Zw^  +  ue 


6  Parallelization  Issues  and  CM  Performance 

A  CM-Fortran  (  essentially  High  Performance  Fortran  )  version  of  the  FV-TD  code  has  been 
developed.  Some  of  the  issues  that  came  up  during  the  development  process  are  discussed. 

Since  High  Performance  Fortran  supports  array  based  data  structures  most  of  the  ‘’Do  Loop  ’ 
processes  in  the  Fortran  77  version  of  the  FV-TD  code  were  transformed  into  simple  array  oper- 

""^'^ELnsive  use  is  made  of  the  CMF  LAYOUT  command.  This  command  instructs  the  compiler 
as  to  how  the  arrays  should  be  laid  out  in  the  computers  memory.  For  example  the  compiler 
directive 


CMF$  LAYOUT  A(;  news,:  news,:  news) 

would  force  A  to  be  spread  across  all  the  parallel  processors.  All  the  electromagnetic  variables 
are  assigned  to  parallel  arrays  using  the  mews  directive.  In  other  words,  all  the  six  components 
of  the  electric  and  magnetic  fields  are  stored  as  parallel  variables  spread  across  all  the  processors. 
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The  more  difficult  parts  of  code  conversion  lie  in  the  implementation  of  the  outer  radiation 
boundary  condition.  The  Liao  boundary  conditions  use  3- dimensional  arrays  but  one  of  the 
dimensions  is  less  than  the  other  two.  Additional  arrays  are  defined  that  store  the  electromagnetic 
variables  at  previous  time  steps  and  a  masking  function  is  used  to  perform  the  boundary  condition 
operations  only  at  the  boundaries.  This  method  is  not  very  efficient  since  most  of  the  processors 
are  sitting  idle  while  the  boundary  operations  are  being  performed. 

However,  the  most  significant  issue  was  that  of  the  implementation  of  the  far  zone  transforma¬ 
tion.  The  far  zone  transformation  is  essentially  a  serial  process  since  the  time  delay  factor  dictates 
the  index  of  the  potential  arrays  where  the  contribution  of  the  surface  arrays  have  to  be  accounted 
for.  This  is  being  achieved  by  a  process  called  send  with  add  and  the  use  of  indirect  array  ref¬ 
erencing,  where  the  time  delays  of  different  points  on  the  far  zone  surface  are  stored  in  an  array 
that  acts  like  a  set  of  pointers  for  the  potential  arrays.  Since  the  electric  and  magnetic  currents 
on  the  far  zone  transformation  surface  are  cross  products  of  surface  areas  and  the  time  derivatives 
of  the  electromagnetic  field  components  they  are  computed  as  two-dimensional  quantities. 

Once  the  components  of  the  magnetic  and  electric  field  currents  are  computed,  their  contri¬ 
bution  is  added  to  the  potential  arrays  which  are  essentially  one- dimensional  quantities.  Hence 
the  current  arrays  are  converted  to  one- dimensional  quantities  using  the  RESHAPE  function.  For 
example  : 

dcrntmx  —  reshape  {mold  =  [imaxl  *  kmaxl] ,  source  =  curntmx) 

Issuing  the  above  command  converts  the  two-dimensional  array  CURNTMX  into  a  one- 
dimensional  array  DCURNTMX  with  a  length  of  imaxl  x  kmaxl. 

For  the  currents  corresponding  to  each  grid  point  on  the  far  zone  transformation  surface  there 
is  a  time  delay  factor  that  is  computed  that  determines  the  appropriate  potential  array  into  which 
the  contribution  of  the  current  is  added  to.  In  other  words  the  time  delay  factor  array  acts  as  a 
pointer  array  to  the  currents  array. 

FORALL  {I  =l:N)  WPOT  {!)  =  WPOT  (!)  + 

SUM  {DCURNTX  (1  :  Ifzc) ,  MASK  =  MM3  ( 1  :  Ifzc)  .eq.I) 

Here  WPOT  is  a  potential  array,  and  MM3  is  a  pointer  array.  The  contribution  of  dcurntx  is 
added  to  the  appropriate  potential  array  block  by  the  masking  condition  provided  by  the  pointer 
array  MM3.  This,  essentially  is  the  logic  behind  the  SEND-WITH-ADD  process. 

A  run  on  the  32  node  CM  —  5  took  19.92  micro-sec/grid  point /time-step.  This  time  included 
the  overhead  of  reading/writing  files.  With  the  far  zone  process  excluded,  an  equivalent  run  on 
the  32  node  CM  -  5  takes  17.38  micro-sec/grid  point/time-step. 

7  Results 

The  first  test  case  considered  here  is  that  of  scattering  of  a  wideband  pulse  from  a  perfectly 
conducting  sphere.  The  grid  used  for  this  case  was  an  0-0  type  grid  and  consisted  of  37x46x73 
grid  points.  The  sphere  is  of  radius  0.1  meters.  The  incident  pulse  is  represented  by  the  derivative 
of  the  Gaussian  and  can  be  explicitly  written  as 

/  (0  =  -2a  (r  -  /3Af)  (18) 
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where  t  is  the  time  delay  parameter 


T  =  t-{r-f)lc 


and  (9  is  a  user  defined  parameter  that  dictates  the  width  of  the  pulse.  The  value  of  ^  used 
for  the  FVTD  run  in  this  case  was  756.  It  should  be  mentioned  at  this  point  that  the  FVTD 
results  are  being  compared  to  the  results  obtained  with  the  PSU-FDTD  code  that  employs  the 
Yee  algorithm.  An  equivalent  ,9  of  64  was  used  in  the  FDTD  run.  (The  FDTD  run  was  made 
with  a  cell  size  of  0.01m  or  ten  cells  per  radii)  The  difference  in  the  /3  value  can  be  attributed  to 
the  difference  in  the  time  step,  because  the  value  of  /3At  has  to  be  the  same  for  both  codes.  Since 
the  value  of  ^  defines  the  spectral  band  width  it  has  to  be  chosen  judiciously  so  as  not  to  create 
any  unnecessary  noise.  Figure  5  is  a  comparison  of  the  backscatter  cross-sertion  against  frequency 
of  "the  FVTD  and  FDTD  algorithms  and  the  exact  solution.  The  comparison  between  the  exact 
solution  and  the  FVTD  solution  is  exceptionally  good  especially  at  low  frequencies  and  is  seen  to 
be  an  improvement  over  the  FDTD  solution.  Figure  6  is  a  comparison  of  far  zone  backscatter  Ee 
component  between  the  serial  version  of  the  FVTD  algorithm  and  a  parallel  implementation  on 
the  CM-5.  The  parallel  far  zone  implementation  seems  to  match  the  serial  algorithm  exactly. 

The  second  test  case  investigated  was  that  of  the  metallic  ogive.  The  ogive  has  a  half  angle 
of  22.62  degrees  and  an  aspect  ratio  of  5:1.  The  length  of  the  ogive  is  10  inches.  An  0-H  type 
grid  was  used  in  this  case  (Figure  7).  The  grid  consisted  of  200  x  31  x  32  cells  and  was  clustered 
near  the  leading  and  trailing  edges  and  close  to  the  surface  of  the  scatterer.  A  wideband  Gaussian 
pulse  is  incident  normal  to  the  leading  edge.  The  direction  of  propagation  of  the  incident  pulse 
is  parallel  to  the  major  axis  of  the  ogive.  In  Figure  8  the  co-polarized  RCS  is  plotted  versus 
frequency  for  the  FVTD  and  FDTD  algorithms.  The  FDTD  results  were  obtained  in  this  case 
with  4  million  grid  points. 

A  third  test  case  was  run  with  a  metallic  ellipsoid  of  the  same  aspect  ratio  as  the  ogive.  Th^ 
case  is  important  because  the  ellipsoid  is  representative  of  the  fuselage  of  an  airplane.  An  0-0 
type  grid  was  used  with  50  x  31  x  72  grid  cells.  An  ultra-wideband  Gaussian  pulse  is  incident 
normal  to  the  leading  edge.  Figure  9  depicts  the  evolution  of  the  scattering  process. 


8  Conclusions  and  Future  Work 

A  finite  volume  code  has  been  developed  based  on  a  dual  grid  approach  for  the  CM-200/CM-5 
machines  Processes  like  the  far  zone,  that  are  inherently  serial  in  nature,  have  been  eflicienUy 
parallelized.  The  algorithm  and  the  code  can  be  used  to  solve  scattering  problems  for  bodies 
of  arbitrarv  shapes.  The  results  obtained  are  in  good  agreement  with  the  exact  solution  an 
conventional  FD-TD  methods.  In  some  cases,  like  the  metallic  ogive  reasonable  results  were 
obtained  with  20  times  less  grid  points. 
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Further  testing  of  the  algorithm  and  the  computer  code  is  underway  for  various  other  EMCC 
(Electromagnetic  Code  Consortium)  benchmark  configurations  like  the  NASA  almond.  The  code 
is  also  being  tested  for  the  RCS  (Radar  Cross  Section)  of  the  GE  Cl  A  engine  inlet. 

9  Acknowledgements 

The  work  done  here  was  supported  by  NASA  under  grant  NAG2-867.  The  authors  would  also 
like  to  thank  NAS  (National  Aerodynamic  Simulation)  and  the  Center  for  Academic  Computing 
at  Penn  State  for  the  computing  facilities  made  available. 

References 

[1]  Yee,  K.S.,  “Numerical  Solution  of  Initial  Boundary  Value  Problems  Involving  Maxwell’s 
Equations  in  Isotropic  Media”,  IEEE  Transactions  on  Antennas  and  Propagation  Vol.  AP- 
14,  No.  3,  May  1966. 

[2]  Goorjian,  P.M.,  “Algorithm  Development  for  Maxwell’s  Equations  for  Computational  Elec¬ 
tromagnetism”,  AIAA  Paper  90-0251,  Reno,  Nevada,  Jan  1990. 

[3]  Madsen,  N.K.,  and  Ziolkowski,  R.W.,  “A  Three-Dimensional  Modified  Finite  Volume  Tech¬ 
nique  for  Maxwell’s  Equations”,  Electromagnetics  Vol  10,  pp  147-161,  1990. 

[4]  Shankar,  V.,  “A  Gigaflop  Performance  Algorithm  for  Solving  Maxwell’s  Equations  of  Elec¬ 
tromagnetics”,  AIAA  Paper  91-1578,  Honolulu,  June  1991. 

[5]  Noack,  R.W.  and  Anderson,  D.A.,  “Time  Domain  Solutions  of  Maxwell’s  Equations  Using  a 
Finite  Volume  Formulation”,  AIAA  Paper  92-0451,  Reno,  Nevada,  Jan  1992. 

[6]  Shang,  J.S.  and  Gaitonde  D.,  “Scattered  Electromagnetic  Field  of  a  Reentry  Vehicle”,  AIAA 
Paper  94-0231,  Reno,  Nevada,  Jan  1994. 

[7]  Weinberg,  Z.  and  Long  L.N.,  “A  Massively  Parallel  Solution  of  the  Three  Dimensional  Navier- 
Stokes  Equations  on  Unstructured  Adaptive  Grids”,  AIAA  Paper  94-0760,  Reno,  Nevada,  Jan 
1994. 

[8]  Ozyoruk,  Y.,  and  Long  L.N.  “A  Navier- Stokes /Kirchhoff  Method  For  Noise  Radiation  From 
Ducted  Fans”,  AIAA  Paper  94-0462,  Reno,  Nevada,  Jan  1994. 

[9]  Chyczewski,  T.S.,  and  Long,  L.N.,  “A  Higher  Order  Accurate  Parallel  Algorithm  for  Aeroa- 
coustic  Applications”,  AIAA  Paper  94-2265,  Colorado  Springs,  Colorado,  June  1994. 

[10]  Kordulla,  W.,  and  Vinokur,  M.,  “Efficient  Computation  of  Volume  in  Flow  Predictions”, 
AIAA  Journal,  Vol.  21,  No.  6,  pp  917-918,  June  1983. 

[11]  Agarwal,  R.K.,  and  Deese,  J.E.,  “Transonic  Wing-Body  Calculations  Using  Euler  Equations”, 
AIAA  Paper  83-0501,  Reno,  Nevada,  Jan  1983. 


521 


[12]  Jameson,  A.,  Schmidt,  W.,  and  Turkel,  E.,  “Numerical  Solutions  of  the  Euler  Equations  by 
Finite  Volume  Methods  Using  Runge-Kutta  Time-Stepping  Schemes”,  AIAA  Paper  81-1259, 
Palo  Alto,  California,  June  1981. 

[13]  Luebbers,  R.J.,  Kunz,  K.S.,  Schneider,  M.  and  Hunsberger,  F.,  “A  Finite-Difference  Time- 
Domain  Near  Zone  to  Far  Zone  Transformation”,  IEEE  Transactions  on  Antennas  and  Prop¬ 
agation,  Vol  39,  No.  4,  April  1991. 

[14]  Liao,  Z.P.,  Wong,  H.L.,  Yang,  G.P.,  and  Yuan  Y.F.  “A  Transmitting  boundary  for  Transient 
Wave  Analysis”,  Scietia  Sinica,  Vol.  28,  No.  10,  pp  1063-1076,  Oct  1984. 


522 


THE  PIECEWISE  LINEAR  RECURSIVE  CONVOLUTION  METHOD 
FOR  INCORPORATING  DISPERSIVE  MEDIA  INTO  FDTD 


David  F.  Kelley*  and  Raymond  J.  Luebbers 
Department  of  Electrical  Engineering 
The  Pennsylvania  State  University 
University  Park,  Pennsylvania  16802 


Abstract  -  The  recursive  convolution  (RC)  method,  developed  by  Luebbers,  et  al.{\\,  [2],  provides  a  fast  and  efficient 
procedure  for  treating  dispersive  media  in  the  finite-difference  time-domain  (FDTD)  method.  However,  the  method  is 
somewhat  less  accurate  than  other  formulations.  A  new  piecewise  linear  recursive  convolution  (PLRC)  method  is  described 
here  that  has  greatly  improved  accuracy  over  the  original  RC  approach  but  that  retains  the  speed  and  efficiency  advantages 
of  the  original  method.  Example  calculations  are  presented  to  demonstrate  the  improvement  in  accuracy  achieved  by  the 
PLRC  me'thod. 


L  INTRODUCTION 

Several  approaches  have  emerged  to  incorporate  the  treatment  of  linear  dispersive,  or 
frequency-dependent,  media  into  the  standard  FDTD  algorithm  of  Yee  [3].  In  the  approach  taken  by 
Luebbers,  et  al  [1],  [2],  the  convolution  of  the  susceptibility  function  and  the  electric  field  in  the  time 
domain  is  expressed  in  discrete  ,time  form.  Because  of  the  exponential  nature  of  the  susceptibility 
function  for  most  materials  of  interest,  the  convolution  can  be  performed  efficiently  using  recursion, 
and  the  method  is  therefore  known  as  the  recursive  convolution  (RC)  approach.  In  the  convolution 
integral,  the  electric  field  is  assumed  to  be  piecewise  constant  over  each  time  step.  In  effect,  the 
electric  Vield  is  approximated  as  a  staircase  function  in  time,  resulting  in  a  method  that,  though 
accurate  enough  for  most  practical  cases,  nevertheless  exhibits  less  accuracy  than  other  existing 
approaches.  A  new  method  is  presented  here,  however,  that  approximates  the  electric  field  within  the 
convolution  integral  using  a  piecewise  linear  function.  The  resulting  piecewise  linear  recursive 
convolution  (PLRC)  method  achieves  accuracy  that  rivals  or  exceeds  that  of  other  methods  but  retains 
most  of  the  speed  and  efficiency  of  the  original  RC  method. 

Because  of  space  limitations,  this  paper  describes  the  PLRC  method  for  a  linear  dispersive 
medium  characterized  by  a  single  Lorentz  complex  conjugate  pole  pair  only.  The  formulations  for 
Debye  media  and  multiple  poles  follow  similar  procedures.  Also,  though  the  discussion  is  couched 
in  terms  of  one-dimensional  problems,  the  extension  to  multiple  dimensions  is  straightforward.  The 
new  method  is  compared  to  the  original  Lorentz  RC  method  of  Luebbers,  et  al  [2]. 

H,  GENERAL  FORMULATION  OF  THE  METHOD 

In  a  dispersive  medium,  the  time  domain  electric  flux  density  D{t)  is  related  to  the  electric  field 
E{t)  by  the  convolution 
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(1) 


D(t)  =  e^e^E(t)  +  | E{t-r)x{T)dT 

0 

where  is  the  permittivity  of  free  space,  eo„  is  the  dielectric  constant  of  the  medium  at  infinite 
frequency,  and  x(0  is  the  susceptibility  function  of  the  medium.  Using  the  notation  i!)”  =  D{n^t)  and 
£"  =  E{nM),  the  convolution  can  be  written  in  discrete  time  form  as 

nA/ 

D  "  =  e^e^E  "  +  /  E(nAt  -  r)  x(r)dr  (2) 

0 

In  the  RC  formulations  presented  in  [1]  and  [2],  the  electric  field  is  assumed  to  be  constant  over  each 
time  interval  Ar  as  shown  in  Fig.  1.  Greater  accuracy  can  be  achieved,  however,  if  the  electric  field 
is  assumed  to  vary  in  a  piecewise  linear  manner.  Referring  to  Fig.  1,  a  piecewise  linear  approximation 
to  the  electric  field  E{t)  over  a  given  interval  \iAt,  (/+l)Ar]  can  be  expressed  in  terms  of  the  discrete 
time  values  E  and  E"^^  as 


E{t)  =  E 


(t  -  iAt) 


(3) 


The  form  of  the  time-reversed  and  offset  electric  field  as  used  in  (2)  is  given  by 


E(nAt-T)  -  E"-'” 


(E 


:-l  _  pn-m\ 

- - - ^(r-mA?) 

At 


(4) 


Substitution  of  (4)  into  (2)  gives,  after  some  manipulation, 

n-1 

m~0 


where 

(m+l)At 

T  =  /  X(T)<ir  (6) 

mAl 

and 


(m+l)A/ 

^  f  -  mEt)x{T)dr 


(7) 
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Substituting  the  expression  for  IT  evaluated  at  times  t  =  and  t  -  (n+l)A?  into  the  discrete  time 
form  of  Ampere’s  Law, 

ctE'’"* 


AtVX//''*“  =  6„6, 


^(E-i  -£  "j  +  +  6,  E  X'"  +  ^ 


WJ  =0 


After  much  algebraic  manipulation,  the  general  form  of  the  FDTD  update  equation  for  the  electric 
field  becomes 


1 ^  Ax'" +(£"■'"'' 

^  „0  _  rO  m=0 


LMf  n  r  I 

+ - +  X  -  S 

e 


^  and  Ar  = 

From  this  point  on,  the  derivation  depends  on  whether  the  dispersive  medium  under  consideration  is 
of  Debye  or  Lorentz  type.  Though  only  the  derivation  for  a  Lorentz  medium  is  shown  here,  the 
derivation  for  a  Debye  medium  follows  a  similar  procedure. 

m.  FORMULATION  FOR  LORENTZ  MEDIA 

The  frequency  domain  susceptibility  function  of  a  Lorentz  medium  is  characterized  by  one  or 
more  pairs  of  complex  conjugate  poles.  For  a  medium  characterized  by  a  single  pole  pair,  the 
susceptibility  function  can  be  written  as 


where  e,  is  the  static 


ic,  or  zero-frequency,  dielectric  constant,  e„  is  the  infinite-frequency  dielectric 
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constant,  is  the  resonant  frequency  of  the  medium,  5  is  the  damping  coefficient,  and  j  =  The 
exp(/a;r)  time  convention  is  assumed.  It  is  convenient  to  rewrite  (12)  in  the  form 


X(«)  = 


_ yP _ 

+/?“)  +  ;2coa  - 


(13) 


where 


a  =  S  ,  li  =  , 


and 


Y  == 


(14) 


The  Fourier  transform  of  (13)  gives  the  time-domain  susceptibility  function 

X(0  =  ye-‘“smlfit)u(t) 


(15) 


The  susceptibility  function  in  the  form  of  (15)  does  not  allow  the  development  of  a  simple 
recursive  convolution  algorithm.  This  problem  was  solved  in  [2]  by  introducing  the  quantity 

U()  = 

where  the  carat  indicates  a  complex -valued  quantity.  The  quantity  x(0  has  the  property 


x(0  =  Re{)(:(0} 

Substitution  of  x(0  for  x(0  into  (6)  and  (7)  yields  the  complex  quantities 

_  ~y  Y  h 
a  -iP 


(17) 


(18) 


and 


pm  ^  -jy  L(-a*il})At 


- —  -a+jp 
At 


At 


(19) 


The  quantities  in  (18)  and  (19)  have  the  properties 


and 


(20) 


and  it  is  easy  to  show  that  they  are  related  to  the  real  quantities  x"*  and  I""  by 
X'"  =  Re{r}  and  ^  =  Re{r} 


(21) 
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Likewise,  it  is  easy  to  show  that  the  quantities 


and  -  T 


have  the  properties 


and  are  related  to  the  real-valued  quantities  Ax'"  and  A^"*  by 

Ax'"  =  Re(Ar}  and  =  Re{Ar} 

rv,  THE  RECURSIVE  ACCUMULATOR 

The  recursive  convolution  algorithm  calculates  the  summation  in  (10)  at  each  time  step  in  a 
computationally  efficient  manner.  In  order  to  derive  the  algorithm,  let 

n-\.  /‘^c\ 

41"  =  j]{£;"-'"Ax'"+ 


The  quantity  i"  is  called  the  recursive  accumulator.  With  the  introduction  of  i/',  the  electric  field 
Update  equation  in  (10)  becomes’ 


A  complex  recursive  accumulator  can  be  defined  for  Lorentz  media  as 

n-l 

r  =  5^  {e'’-'" AX'"  +  -£""")  AH 


Note  that,  using  (24), 
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i|j"  =  Rejilf”} 


(28) 


Equation  (27)  can  be  rewritten  as 

n-2 

m=0 

The  summation  in  (29)  is  equivalent  to  with  the  terms  Ax'”  and  Af"*  replaced  by  Ax"*^*  and 
A$"*+i,  respectively.  Substituting  (23)  into  (29)  leads  to  the  recursion  relation 

ijr"  =  (AX®-A^°)£:”  + + 


At  each  time  step  during  the  execution  of  the  PLRC  algorithm  for  Lorentz  dispersive  media, 
the  complex  recursive  accumulator  is  updated  using  (30).  The  electric  field  is  then  updated  using  (26), 
incorporating  the  new  recursive  accumulator  value  found  via  (28).  The  values  of  x°  and  that  are 
required  in  the  electric  field  update  equation  (26)  are  found  by  setting  m  =  0  in  (18),  (19)  and  (21). 

The  recursive  accumulator  update  equation  (30)  is  very  similar  to  equation  (20)  in  [2].  The 
PLRC  method  for  Lorentz  media  requires  only  one  additional  real  backstore  variable  (the  electric  field 
value  at  time  (K-l)Ar)  over  those  required  for  the  original  RC  method,  regardless  of  the  number  of 
poles.  However,  the  PLRC  method  achieves  much  greater  accuracy  than  the  original  method  with  only 
a  minor  sacrifice  in  program  speed  and  memory  requirements. 

V.  PTOMERICAL  RESULTS 

To  demonstrate  the  improvement  in  accuracy  achieved  by  the  PLRC  method  over  the  original 
RC  method,  both  methods  were  applied  to  the  problem  of  finding  the  reflection  coefficient  as  a 
function  of  frequency  at  the  interface  between  free  space  and  a  dispersive  half-space  for  normal 
incidence.  The  dispersive  half  space  used  in  the  example  is  a  Lorentz  medium  characterized  by  a 
single  pair  of  complex  conjugate 'poles  with  parameters  =  3.0,  =  1.8,  =  27r  x  20  GHz  and 

5  =  0.  loj^.  The  conductivity  a  is  zero.  The  incident  pulse  used  in  the  FDTD  solution  is  Gaussian  with 
a  width  of  33  cells  at  the  0.001-amplitude  points  and  contains  significant  energy  at  frequencies  up  to 
approximately  100  GHz.  Since  this  is  a  1-D  problem,  simple  first  order  Mur  absorbing  boundary 
conditions  [4]  were  used  at  the  edges  of  the  computation  space. 

The  FDTD  solution  used  a  cell  size  of  250  fim  and  a  time  step  size  of  0. 125  ps  (the  Courant 
limit).  The  FDTD  space  was  450  cells  wide,  and  the  interface  was  located  45  cells  from  the  "incident” 
edge.  A  total  field  FDTD  formulation  was  used.  The  reflection  coefficient  was  calculated  by  taking 
the  FFT  of  the  reflected  pulse  on  the  ”air”  side  of  the  interface  and  the  FFT  of  the  incident  pulse  at 
the  same  location.  Dividing  each  frequency  component  of  the  FFT  of  the  reflected  pulse  by  the 
corresponding  component  of  the  FFT  of  the  incident  pulse  gives  the  reflection  coefficient  as  a  function 
of  frequency.  The  FDTD  algorithm  was  allowed  to  run  for  825  time  steps  to  ensure  that  all  transients 
dissipated  before  the  FFTs  were  performed. 
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The  reflection  coefficient  magnitudes  at  the  interface  computed  using  the  PLRC  method  and 
the  original  RC  method  are  compared  to  the  analytical  solution  in  Fig.  2.  The  RC  solution  strays 
significantly  from  the  analytical  solution  near  the  resonance  peak  and  around  40  GHz.  The  PLRC 
solution  however,  follows  the  analytical  curve  closely  at  all  frequencies.  The  error  of  the  computed 
reflection  coefficient  for  the  PLRC  and  RC  methods  is  compared  in  Fig.  3,  In  the  resonance  region, 
the  error  obtained  using  the  PLRC  method  is  an  order  of  magnitude  less  than  that  obtained  using  the 
RC  method.  Both  error  curves  diverge  from  zero  at  the  higher  frequencies  where  the  spatial  samp  mg 
begins  to  become  inadequate. 


VI.  CONCLUSIONS 

The  PLRC  approach  described  here  for  analyzing  propagation  through  linear  dispersive  media 
achieves  greatly  improved  accuracy  over  that  of  the  original  RC  method  of  Luebbers,  et  a/.,  yet 
retains  excellent  speed  and  computational  efficiency.  The  number  of  real  storage  variables  required 
beyond  the  standard  (non-dispersive)  FDTD  algorithm  to  implement  the  PLRC  method  is  only  2F+1 
for  Lorentz  media,  where  P  is  the  number  of  complex  pole  pairs  in  the  susceptibility  function, 
compared  to  the  2P  real  variables  required  by  the  original  method.  The  PLRC  approach  also  requires 
only  one  extra  complex  multiplication  and  one  extra  complex  addition  per  pole  per  cell  per  dimension 
beyond  that  required  for  the  original  RC  method. 
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Fig.  1.  Piecewise  linear  (solid)  and  staircase  (dotted)  approximations  to  electric  field. 
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REFLECTION  COEFFICIENT  AT  INTERFACE 
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Fig.  5,  Reflection  coefficient  magnitude  at  interface  computed  using  the  PLRC  method  and  the 

RC  method  compared  to  the  analytical  solution.  The  FDTD  cell  size  is  250  ^m. 


ERROR  OF  COMPUTED  REFLECTION  COEFFICIENT 


Fig.  6.  Error  of  computed  reflection  coefficient  magnitude  obtained  using  the  PLRC  method 

compared  to  that  obtained  using  the  RC  method.  The  cell  size  is  250  /urn. 
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1  Introduction 

Typically,  in  explicit  time  domain  electromagnetic  simulators,  the  problem  space  is  discretized  in  a 
selected  orthogonal  coordinate  system,  usually  a  rectangular  one.  Since  there  are  apparent  limita¬ 
tions  of  such  meshes,  other  solutions  have  been  previously  developed,  e.g.  unstructured  meshes[l] 
and  numerically  generated  orthogonal,  conformal  meshes [2].  While  very  elegant  and  allowing  for 
a  coarser  discretization,  these  solutions  are  not  easy  to  implement,  require  extensive  data  prepro¬ 
cessing  and  introduce  significant  computational  overhead  associated  with  the  complicated  nature 
of  the  mesh. 

In  various  electromagnetic  problems,  for  instance  those  involving  antennas  or  waveguide  discon¬ 
tinuities,  the  geometry  of  the  elements(substructures)  is  simple  and  can  be  readily  described  in 
one  of  the  common  orthogonal  coordinate  systems.  More  efficient  mesh  schemes  can  be  accom¬ 
plished  in  such  cases.  For  example,  in  [6]  a  technique  was  described  and  applied  to  solve  Maxwell 
equations  using  a  classical  FDTD  scheme,  where  local  simple  meshes  (cylindrical,  spherical  or 
rectangular)  conforming  to  the  structure  boundaries  were  introduced  and  interfaces  were  created 
to  ’’stitch”  them  with  the  global  mesh  of  the  problem.  A  similar  approach  was  described  in  [4] 
for  solving  Maxwell  equations  using  Lax-Wendorff  explicit  scheme. 
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The  interfacing  of  different  grids  involves  a  few  numerical  problems.  In  order  to  transfer  the  data 
from  one  grid  to  another  it  is  necessary  to  interpolate  the  fields  obtained  in  one  coordinate  system 
to  the  points  located  at  mesh  nodes  in  the  second  grid.  Yee  at  al.  [6]  used  linear  interpolation 
for  both  electric  and  magnetic  fields.  They  observed  that  when  interfacing  rectangular  grid  with 
a  curvilinear  one,  instabilities  occur  if  the  grids  are  too  loosely  coupled.  As  a  remedy  to  this 
problem  some  overlapping  of  two  grids  was  recommended  for  the  magnetic  field  in  the  curvilinear 
grid,  using  an  average  of  values  obtained  through  interpolation  from  rectangular  grid  and  the 
curvilinear  FDTD  algorithm.  This  approach  implies  that  in  fact  the  simulation  is  done  twice 
within  the  overlap  zone.  Additionally,  since  the  structure  investigated  in  [6]  was  open,  it  was 
imposible  to  infer  if  any  late  time  instabilities,  sometimes  occuring  in  finite  difference  scheme, 
actually  occured  when  two  grids  were  used. 

Since  in  theory  it  suffices  to  pass  only  the  boundary  values  from  one  grid  to  the  other  we  decided 
to  investigate  whether  a  stable  and  accurate  interfacing  can  be  done  with  minimal  overlap,  ie. 
with  an  overlap  zone  only  one  mesh  layer  deep.  The  minimal  overlap  is  important,  especially  for 
modern  latest  multi  processor  workstations  in  which  the  computations  for  two  grids  can  be  carried 
out  in  parallel  by  different  CPUs.  An  increase  of  the  overlap  between  grids  increases  the  amount 
of  information  exchanged  between  the  processors  which  in  turn  degrades  the  overall  performance 
of  parallelized  computations. 


2  Formulation 

In  brief  the  procedure  of  interfacing  of  two  grids  is  as  follows: 

•  A  global  grid  is  selected. 

•  an  auxiliary  grid(s)  is(are)  selected  which  conforms  to  the  selected  element(s)  of  the  struc¬ 
ture.  The  boundary  between  the  grids  is  determined. 

•  within  each  grid  the  nodes  are  either: 

—  internal  —  if  all  its  neighbors  are  also  internal 

—  boundary  —  if  at  least  one  of  its  neighbors  lies  in  the  complementary  mesh. 

•  if  the  element  is  internal,  a  standard  leap-frog  algorithm  is  used  to  advance  time. 

•  if  the  element  is  the  boundary  one,  an  interpolation  is  used  to  compute  the  field  quantity 
from  the  surrounding  four  nodes  of  the  complementary  mesh  (Fig.l).  This  interpolation 
scheme  creates  a  link  between  the  two  meshes. 

It  is  assumed  that  the  boundary  is  situated  in  a  homogeneous  region  of  the  structure,  ensuring 
proper  differential  properties  of  the  fields  and  thus  the  accuracy  of  the  interpolation. 

To  implement  this  procedure,  it  suffices  to  overlap  the  two  grids  by  one  mesh  layer.  In  order 
verify  if  this  minimal  overlap  works  in  practice,  we  have  examined  a  canonical  2D  problem  of 
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TM  field  propagation  in  a  cylindrical  waveguide  with  perfectly  conducting  walls  and  normalized 
radius  R  =  1.  This  choice  of  the  model  problem  has  two  important  advantages.  First,  the 
analytical  solution  is  known,  and  second,  since  a  shielded  problem  is  considered,  no  energy  is 
radiated  (dissipated)  from  the  structure  allowing  for  the  occurence  of  late  time  instabilities  to  be 
easily  detected  and  studied. 

The  rectangular  grid  is  selected  in  the  center  of  guide,  while  the  cylindrical  one,  conforming  to 
the  shape  of  boundary,  is  employed  outside  the  central  region.  The  interface  between  grids  is 
implemented  for  r  =  rjnt  <  F 

The  wave  propagation  inside  the  guide  can  be  expressed  by  the  following  initial  value  problem 

=  0  (1) 

=  0  (2) 

(l){r,ip,0)  =  (^) 

where  (j)  is  the  scalar  potential  for  TM  modes  and  c  is  the  velocity  of  light. 

The  above  equations  involve  only  one  scalar  potential  instead  of  usual  5  field  components  required 
for  classical  FDTD  formulation  using  Yee’s  mesh  for  TM  modes.  The  formulation  based  on  a  scalar 
wave  equation  is  particularly  advantageous  for  interfacing  grids. 

To  ensure  a  smooth  transition,  grids  should  be  seamed  in  a  homogeneous  region  of  the  structure. 
It  was  shown,  however  [5],  that  in  homogeneous  regions  it  is  possible  to  replace  the  usual  vector 
formulation  of  the  FDTD  scheme  with  an  equivalent  version  based  on  two  scalar  wave  equations 
for  TE  and  TM  modes.  Thus  an  algorithm  based  on  two  scalars  can  be  devised  rather  than  on  6 
field  quantities  leading  to  a  simplified  and  hopefully  more  efficient  process  of  interfacing  of  meshes. 


3  Interpolation  methods 


There  are  a  few  algorithms  which  can  be  used  for  interpolating  scalar  potential  from  one  grid  to 
another.  The  simplest  one  is  the  bilinear  interpolation.  This  algorithm  requires  only  knowledge 
of  the  potential  values  at  four  inner  nodes  surrounding  the  boundary  node.  But  the  bilinear 
interpolation  does  not  guarantee  the  continuity  of  the  derivatives.  By  using  more  than  four  inner 
nodes  it  is  possible  to  use  higher  order  interpolation  schemes.  Since  the  interface  is  at  r  ~  nnt  it 
is  easier  to  implement  the  higher  order  schemes  when  passing  from  cylindrical  grid  to  rectangular 
one.  For  numerical  tests  we  implemented  the  following  algorithms. 

For  the  interpolation  from  rectangular  grid  to  cylindrical  one: 

1.  bilinear  (BL) 

2.  bicubic  (BC) 

For  the  interpolation  from  cylindrical  grid  to  rectangular  one: 
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1.  bilinear  (BL) 

2.  bicubic  (BC) 

3.  bicubic  splines  (BCS) 

4.  splines  under  tension  (SUT) 

The  bicubic  interpolation  involves  only  the  information  on  the  potential  and  its  derivatives  at  four 
nearest  neighbors.  For  calculating  splines  we  used  potential  values  calculated  at  all  points  located 
in  the  ring  containing  the  boundary  points  of  the  rectangular  grid.  For  the  bicubic  interpolation 
and  bicubic  splines  we  used  the  procedures  from  [7],  while  for  the  splines  under  tension  we  applied 
the  subroutines  from  the  FITPACK  library  [8]. 


4  Numerical  results 

For  the  time  domain  solution  of  equation  (1)  the  leapfrog  explicit  scheme  was  used.  The  numerical 
simulations  were  carried  out  for  the  interface  between  grids  located  at  r  =  nnt  =  0.4  and  r  = 
=  0.6  In  both  cases  the  cylindrical  grid  was  assumed  to  have  30  by  62  points  in  the  radial  and 
angular  direction,  respectively.  The  rectangular  grid  was  assumed  to  be  based  on  square  cells  but 
the  discretization  step  was  different  for  r  —  0.4  and  r  =  0.6.  It  was  selected  to  match  the  size  of 
a  rectangular  cell  with  the  cylindrical  one  at  the  interface. 

The  errors  relative  to  analytical  results  are  shown  for  different  interpolation  schemes  in  Tables  1 
and  2.  In  all  cases  8000  iteration  steps  were  performed. 


Table  1:  The  errors  relative  to  analytical  results  and  CPU  time  for  different  interpolation  schemes 
and  interface  between  grids  at  rint  =  0.4 


Interpolation  scheme 

Relative  error 

Normalized  CPU 

From  rect'from  cyl 

TMoi 

mi 

TMji 

T  Mq2 

BC-BC 

-0.014 

-0.0027 

-0.012 

-0.0009 

1 

BL-BCS 

-0.012 

-0.0024 

-0.012 

-0.0009 

0.52 

BL-SUT 

gililiM 

BL-BL 

1  -0-011 

-0.001 

-0.013 

-0.0023 

0.43 

In  the  Tables,  the  codes  the  first  column  describe  the  scheme  used  for  interpolation  from  the 
rectangular  to  cylindrical  grid  and  vice  versa.  The  codes  are  explained  in  the  preceding  section. 
The  last  column  in  Table  1  shows  the  total  CPU  time  normalized  to  the  CPU  time  required  by 
BC-BC  interpolation  i.e.  bicubic  interpolation.  It  can  be  seen  that  low  order  bilinear  interpolation 
scheme  is  much  faster  than  the  higher  order  bicubic  interpolation.  Since  BL-BL  may  be  expected 
to  be  the  least  accurate,  a  good  compromise  between  speed  and  interpolation  accuracy  may  be 
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Table  2:  The  errors  relative  to  analytical  results  for  different  interpolation  schemes  and  interface 
between  grids  at  rjnt  =  0.6 


Interpolation  scheme 

Relative  error  | 

From  rect-from  cyl 

TMoi 

TMix 

TM21 

TMq2 

BL-BCS 

-0.0082 

0.0035 

-0.0038 

0.001 

BL-SUT 

-0.009 

0.0027 

-0.004 

0.001 

BL-BL 

-0.0086 

0.0048  ' 

-0.0035 

0.002 

achieved  by  combining  the  bilinear  interpolation  for  passing  from  rectangular  to  cylindrical  grid 
with  splines  for  interpolating  in  the  other  direction. 

To  investigate  the  difference  between  the  interpolation  schemes  we  have  plotted  in  Fig.2  the  local 
relative  difference  between  time  samples  obtained  from  BL-BL  and  BC-BC  scheme.  It  can  be 
seen  that  the  local  difference  at  some  instances  is  quite  significant  and  generally  increases  as  the 
simulation  progresses.  This  local  error  does  not,  however,  influence  the  results  of  the  spectral 
analysis. 


5  Early  and  late  time  instabilities 

In  the  preceding  section  we  showed  that  accurate  results  can  be  obtained  with  minimal  mesh  over¬ 
lap  for  realistic  simulation  times  (8000  samples).  In  order  to  evaluate  instabilities,  we  investigated 
the  case  when  there  is  a  mismatch  between  mesh  cells.  This  situation  takes  place  for  instance  when 
Tint  =  0.2  and  the  grid  step  in  the  angular  direction  is  increase  two  times  in  the  cylindrical  grid. 
Fig. 3  shows  the  the  time  record  of  the  samples  taken  every  100th  iteration.  It  can  be  noted,  that 
instabilities  develop  after  2000th  iteration.  The  instability  occurs  for  all  interpolation  schemes 
which  we  tested.  It  was  interesting  to  observe  that  improving  the  approximation  order  does  not 
postpone  the  moment  the  instability  starts  to  develop.  This  may  indicate  that  the  reason  for  the 
instabilities  may  be  not  the  approximation  error,  but  different  numerical  dispersion  properties  of 
the  grids  that  become  apparent  when  cells  do  not  match. 

Having  discovered  the  early  time  instability  and  their  immunity  to  the  interpolation  scheme  we 
decided  to  look  back  at  the  examples  discussed  in  the  previous  section  but  run  this  time  the 
simulation  much  longer.  Fig.4  shows  the  results  for  24000  samples  and  =  0.6  using  BL-BL 
interpolation.  It  is  seen  that  no  signs  of  instabilities  occur  for  the  first  20000  time  steps  but  after 
that  instability  develop  slowly.  At  this  time  we  are  not  certain  if  this  late  time  instability  can 
be  attributed  to  the  accumulation  of  approximation  errors  or  different  dispersion  properties.  We 
expect  to  address  this  problem  in  the  future  research. 
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6  Conclusions 


We  have  demonstrated  that  interfacing  of  the  grids  of  different  type  is  possible  even  with  minimal 
overlap  provided  the  mesh  cell  sizes  are  matched  at  the  interface.  Although  the  interface  works 
well  for  realistic  simulation  times,  very  late  time  instabilities  may  develop  in  extremely  long 
simulations. 


7  Acknowledgment 


This  work  has  been  performed  during  one  of  the  authors  (MM)  visit  at  the  University  of  Victo¬ 
ria.  Grants  from  the  Natural  Sciences  and  Engineering  Research  Council  of  Canada,  B.C. Hydro, 
TransAlta  Util.,  and  TR  Labs  are  greatfully  acknowledged. 


References 

[1]  N.K.  Madsen,  R.W.Ziolkowski,  ’’Numerical  solution  of  Maxwell’s  equations  in  time  domain 
using  irregular  non-orthogonal  grids,”  Wave  Motion,  vol.lO,  ..583-596,  Dec,  1988 

[2]  M. Fusco,  “FDTD  Algorithm  in  Curvlinear  Coordinates”,  IEEE  Trans.  Antennas  Propag.^ 
vol  AP-38,  No.l,  pp76-89,  Jan  1990. 

[3]  K.S.  Kunz,  R.J.  Luebbers,  ^‘Finite  Difference  Time  Domain  Method  for  Electromagnetics^ ^ 
CRC  Press  Inc.,  1993. 

[4]  M-D. White  J.J.Chattot,  ’’Investigation  of  the  Use  of  Chimera  for  Solving  Maxwell’s  Equa¬ 
tions  for  Electromagnetic  Propagation  in  the  Time  Domain”,  Proc.  9th  Annual  Rev.  Applied 
Computational  Electromagnetics,  pp352-359,  Monterey,  March  22-26,  1993. 

[5]  P.Aoyagi,  J.F.Lee  and  R.Mittra,  ”A  hybrid  Yee  algorithm /scalar- wave  equation  approach,  ” 
IEEE  Trans.  Microwave  Theory  Tech.,  vol  MTT-41,  pp.  1593-1600,  Sept.  1993. 

[6]  K.S. Yee.,  J.S.Chen,  A. H. Chang,  ’’Conformal  Finite  Difference  Time  Domain  (FDTD)  with 
Overlapping  Grids,”  IEEE  Trans.  Antennas  Propag.,  vol  AP-40,  No.9,  ppl068-1075,  Sept. 
1992. 

[7]  W.H.  Press,  B.P.  Flannery,  S.A.  Teukolsky  and  W.T.  Vetterling,  Numerical  recipes:  The  art 
of  scientific  computing,  Cambridge  University  Press,  Cambridge,  1986. 

[8]  Netlib  software  library  avaliable  via  e-mail  (netlib@ornl.gov) 


539 


1  ' 
1 

-1  ... 

J 

1 

/ 

^  Cylindrical 

\  / 

1 

1 

/ 

/ 

/ 

/\ 

/  \ 

\ 

f - ; 

-  ."i 

\  1 

\  ^  ^ ' 

\ 

\  \ 

\  \  , 

1 

\ 

r 

1 

f 

1  1 

L 

1 

1  1 
,  1  1 

Rectangular 


Figure  1:  An  interface  between  a  rectangular  and  cylindrical  meshes.  Rectangles  represent  nodes  of 
rectangular  mesh,  circles  of  cylindrical  one.  Filled  symbols  depicts  the  interpolation  of  cylindrical 
nodes,  while  open  symbols  explain  the  interpolation  of  nodes  of  the  rectangular  mesh. 


Figure  2:  The  time  record  of  a  local  error  between  BL-BL  and  BC-BC  interpolation  (interface  at 
Tjnt  =  0.4).  Only  every  100th  sample  is  shown 
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Figure  3:  The  time  record  for  interface  at  rjnt  =  0.2  and  unmatched  cells  in  both  grids.  Only 
every  100th  sample  is  shown 


Figure  4:  The  time  record  for  interface  at  rint  =  0.4  and  BL-BL  interpolation  between  grids  with 
matched  cell  size  in  both  grids.  Only  every  100th  sample  is  shown 


Time  Domain  Response  of  Simulated  2D  Composite  Scatterers 
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Two-dimensional  composite  objects  can  be  simulated  by  dividing  the  object  into  small  cells  and  then 
replacing  each  cell  with  a  homogeneous  circular  cylinder.  This  paper  investigates  the  ability  of  the  simulation 
technique  to  calculate  the  time  domain  response  of  composite  scatterers.  In  this  work,  the  transient  plane- 
wave  incident  on  the  scatterer  has  a  center  frequency  and  is  band-limited.  The  time  domain  scattering  data  is 
generated  using  an  appropriate  inverse  fast  Fourier  transform  of  the  time-harmonic  far-field  scattering  data 
determined  at  discrete  frequencies.  Results  of  the  simulation  method  are  compared  with  moment  method 
results. 

Introduction  ....  ut  vp  u 

Calculating  the  field  scattered  from  two-dimensional  composite  objects  is  a  diflScult  problem  if  the 

inhomogeneity  does  not  follow  any  particular  spatial  pattern.  However,  if  the  scattering  object  is  discretized 
into  small  cells  and  if  each  ceU  is  simulated  by  a  homogeneous  circular  cylinder,  efficient  solutions  are 
available  to  predict  the  scattering  width  of  the  composite  structure  [1].  The  validity  of  the  simulation  of  two 
dimensional  composite  scatterers  by  small  circular  cylinders  has  been  proved  through  the  years  for  a  TM 
polarized  wave  [2,  3].  In  recent  publications  [4,  5],  a  successful  simulation  for  TE  polarized  waves  is 
presented.  The  main  advantage  of  using  circular  cylinders  for  simulation  is  that  the  cylinders  are  canonical 
objects  and  an  exact  series  solution  for  each  type  of  cylinder  is  easily  obtainable.  The  scattering  from  the 
composite  scatterer,  however,  incorporates  the  exact  solution  of  each  cylinder  and  the  infinite  multiple 
interaction  between  them.  The  solution  is  theoretically  exact  and  the  only  approximation  involved  for 
obtaining  numerical  data  is  due  to  the  truncation  of  the  interaction  matrix.  This  type  of  approximation  is 
similar  to  those  used  in  many  other  numerical  techniques. 

Although  the  simulation  technique  works  very  well  for  calculating  time-harmonic  far  and  near  fields, 
we  are  not  aware  of  any  attempt  to  use  the  simulation  of  composite  scatterers  with  cylinders  for  time  domain 
analysis.  The  objective  of  this  paper  is  to  investigate  the  validity  of  the  simulation  technique  for  calculating 
the  time  domain  response  of  composite  scatterers.  In  this  work,  the  transient  plane-wave  incident  on  the 
scatterer  has  a  center  frequency  and  is  band-limited.  The  time  domain  scattering  data  is  generated  using  an 
appropriate  inverse  fast  Fourier  transform  of  the  time-harmonic  far-field  scattering  data  determined  at 
discrete  frequencies  [6]. 

Here  we  investigate  the  ability  of  the  simulation  technique  to  predict  the  transient 
electromagnetic  wave  returned  from  structures  with  local  as  well  as  non-local  and  resonant  scattering 
behavior.  For  this  study,  two-dimensional  (2D)  conducting  bodies  in  the  upper  resonance  region  were 
chosen  because  they  are  the  simplest  targets  that  demonstrate  all  of  the  desired  scattering  behavior. 
Scattering  bodies  considered  here  are  a  flat  plate  and  a  rectangular  cavity.  Numerical  comparison 
between  the  data  from  the  simulation  technique  and  the  moment  method  (MM)  is  presented.  Using  tWs 
comparison,  we  show  the  advantages  and  disadvantages  of  the  simulation  technique  for  predicting  the  time 
domain  response  of  these  two  scatterers.  Note  that  the  results  presented  here  are  for  perfect  electric 
conductor  (PEC)  scatterers  only.  This  is  because  of  the  large  execution  time  required  for  obtaining  reference 
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results  for  composite  scatterers  using  the  MM.  However,  we  believe  that  the  real  advantage  of  the  simulation 
method  is  its  ability  to  calculate  the  transient  scattering  from  inhomogeneous  bodies  rapidly  when  compared 
to  other  methods  [1]. 

Technical  Discussion 

A  detailed  description  of  the  MM  solution  for  the  time-harmonic  electric  field  in  the  far  field  of  a 
2D  scatterer  is  given  in  [7],  A  similar  description  for  the  simulation  method  is  given  in  [1,  4]. 

Because  we  assume  that  the  reader  is  in  general  familiar  with  these  numerical  techniques,  we 
have  chosen  to  devote  this  section  to  describing  the  calculation  of  the  scatterers’  transient  response 
from  the  calculated  time-harmonic  electric  field.  The  definitions  and  descriptions  in  this  section  are 
presented  for  the  general  three  dimensional  case.  Definitions  are  then  specialized  to  the  2D  case  only  if 
they  are  different  from  the  general  expressions. 


The  Target  Signature 

For  a  particular  scatterer,  the  received  radar  signal  is  here  referred  to  as  the  target  signature. 
The  target  signature,  g(t),  represents  some  field,  voltage,  or  current  quantity  in  the  radar  receiver.  If  the 
scatterer  is  stationary  and  has  no  moving  parts,  the  target  signature  is  described  by  the  following 
equation  in  which  convolution  is  denoted  by  an  asterisk  (*): 

g(t)  =  aR(t)  *  ap(t)  *  h(t)  *  ap(t)  *  ap(t)  *  f(t)  (1) 

In  the  expression  above,  h(t)  is  the  target  impulse  response  function,  aT(t)  is  the  impulse  response  of 
the  transmitter  and  transmit  antenna,  aj^  (t)  is  the  impulse  response  of  the  receiver  and  receive  antenna, 
and  f(t)  is  the  transmitted  signal.  The  signal  f(t)  represents  some  field,  voltage,  or  current  quantity  in  the 
radar  transmitter.  In  (1),  ap(t)  accounts  for  propagation  of  the  signal  between  the  radar  and  the  target 
and  is  taken  to  be 

where  R  is  the  range  from  the  radar  to  the  target,  c  is  the  speed  of  light  and  5  is  the  Dirac  delta 
function. 

The  target  impulse  response,  the  impulse  response  of  the  transmitter,  and  the  impulse  response 
of  the  receiver  are  easiest  to  define  in  terms  of  their  Fourier  transforms,  H(Q,  Aj(f),  and  Aj.(f).  Here 
the  standard  engineering  definition  of  the  Fourier  transform  is  used  so  that  the  target  transfer  fiinction  is 
related  to  the  target  impulse  response  by  the  following  equation: 

H(f)  =  j  h(t)e-''^’^dt  (3) 

The  value  of  H(f)  at  a  particular  frequency  f  can  be  determined  by  considering  a  time-harmonic  plane- 
wave  incident  on  the  target  and  traveling  in  the  direction  of  the  unit  vector  k .  The  incident  electric  and 
magnetic  fields  written  in  phasor  form  (with  e^^^^  time  variation  assumed  and  suppressed)  are 

=  (4) 

and 

Hi(r)  =  (Ejno)(kxp)e-^2’**^''  (5) 
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where  is  the  constant  amplitude  of  the  incident  electric  field,  p  is  the  unit  polarization  vector  (here 
linear  polarization  is  assumed  for  simplicity),  and  is  the  characteristic  impedance  of  free  space. 
Quantities  in  boldface  type  are  vectors.  The  origin  of  the  coordinate  system  used  to  define  r  is  picked  at 
a  convenient  point  close  to,  inside,  or  on  the  scatterer.  The  origin  is  denoted  with  the  symbol  O  For  a 
monostatic  radar  system,  the  Fourier  transform  of  the  target  impulse  response  is  given  by  the  expression 

(6) 

E.(<9)  p  L  J 

In  the  expression  above  it  is  assumed  that  the  polarization  of  the  receive  antenna  is  matched  to  the 
polarization  of  the  transmit  antenna.  For  a  radar,  the  distance  R  from  the  origin  to  the  antennas  is 
assumed  to  be  electrically  large  so  that  the  scattered  electric  field  due  to  the  target.  Eg ,  in  (6)  can  be 
calculated  using  the  standard  far-field  approximations.  The  incident  electric  field  at  the  origin  in  (6)  can 
be  approximated  as  a  plane  wave  as  expressed  in  (4)  because  the  variation  of  the  incident  field  with  R  is 
unimportant  in  this  equation  (identical  variation  with  R  occurs  in  Eg  and  hence  cancels  the  variation  in 
the  incident  field)  and  because  the  spherical  wave  fi-onts  are  approximately  planar  at  the  target  in  the 
far-field  of  the  transmit  antenna.  For  a  2D  scatterer,  a  slightly  different  definition  of  H(f)  is  used.  For 
the  2D  case 


(7) 


where  p  is  the  distance  from  the  z-axis  in  the  cylindrical  coordinate  system. 


The  Equivalent  Transmitted  Signal 

To  obtain  a  more  convenient  form  for  analysis,  equation  (1)  can  be  rewritten  using  the 
commutative  property  of  convolution  to  obtain  the  following: 

g(t)  =  *  aR(t) * f(t  - 2R  /  c)l  ^ 

I  47rR^  J 

In  the  expression  above,  the  total  time  delay  is  accounted  for  in  f(t).  The  quantities  in  the  curly  brackets 
in  (8)  are  assumed  known  and  so  it  is  convenient  to  combine  them  and  define  them  as  a  single  quantity. 
This  quantity  is  here  referred  to  as  the  equivalent  transmitted  signal,  f ®(t) . 

rU')  -  -  2R  /  c) 

^  ^  4jcR^ 

Using  the  equivalent  transmitted  signal  the  target  signature  can  be  expressed  as 

g(t)  =  r(t)*h(t).  (10) 


The  form  of  (10)  is  useful  because  f®(t)  includes  the  effect  of  the  receive  antenna  and  receiver,  the 
transmit  antenna  and  transmitter,  and  the  range  to  the  target;  while  the  target  impulse  response  h(t) 
isolates  the  scattering  characteristics  of  the  target  (note  however  that  h(t)  is  also  a  function  of  the 
transmit  and  receive  polarization). 

Because  the  Fourier  transform  of  the  equivalent  transmitted  signal  is  band-limited,  it  is  only 


necessary  to  determine  H(f)  for  frequencies  in  the  pass  band  of  F  (f)  in  order  to  calculate  g(t).  To 
investigate  the  target  signature  any  suitable  band-limited  function  can  be  chosen  for  the  equivalent 
transmitted  signal.  To  aid  in  the  interpretation  of  the  time-domain  results,  a  standard  window  function  is 
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In  (11),  is  the  center  frequency  and  Af  is  the  bandwidth.  The  coefficients  ao,  aj,  and  a2  are  as 
follows:  ao  =  0.42,  ai  =  0.50,  and  a2  =  0.08. 

The  Radar  Profile 

Because  it  is  a  band-limited  function  the  target  signature,  g(t),  can  be  expressed  in  the  form 

g(t)  =  gi(t)cos(2jtf(,t)  +  g,(t)sin(25tfot)  (13) 

where  gi(t)  and  gq(t)  are  the  in-phase  and  quadrature  portions  of  g(t).  Equation  (13)  can  also  be  written 
in  envelope  and  phase  form. 

g(t)  =  p(t)cos{2rf„t  +  tan-^[g,(t)/gi(t)]}  (14) 

where 


Because  it  is  a  low-pass  limited  function,  the  envelope  of  g(t)  is  used  to  illustrate  the  scatterers  transient 
response  rather  than  g(t)  itself  The  envelope  of  the  target  signature  is  here  referred  to  as  the  radar 
profile  and  is  denoted  p(t). 

Haykin  introduces  the  complex  envelope  notation  to  make  band-pass  limited  signal  and  system 
problem  isomorphic  to  the  base-band  signal  and  system  problem  [8].  Using  the  complex  envelope 
notation  the  radar  profile  can  be  determined  in  a  straight-forward  manner  fi-om  F®(f)  and  H(f).  The 
complex  envelope  of  g(t)  is  denoted  as  g(t)  and  is  defined  as 

g(t)  =  gi(t)  +  jg,(t).  (16) 

The  function  g(t)  can  be  recovered  from  g(t)  using  the  following  expression: 

g(t)  =  Re{g(t)exp(j2rf„t)} .  (17) 

The  Fourier  transform  of  the  complex  envelope  can  be  determined  fi-om  the  Fourier  transform  of  g(t) 
using  the  equation 


G(f)  =  G(f  +  f sgn(f  +  fo)G(f  +  f J 


(18) 


where 


1  for  f  >  0 

sgn(f)  =  ‘  0  for  f  =  0 .  (19) 

-1  for  f  <  0 

In  an  identical  manner  the  complex  envelope  of  the  equivalent  transmitted  signal  can  be  defined  so  that 

f'(t)  =  f'(t)  +  jf‘(t)  (20) 

f‘(t)  =  Re|f'(t)exp(j27Efot)}  (21) 

and 

F'  (f )  =  F'  (f  +  f„  )  +  sgn(f  +  fo )  F'  (f  +  f„  )  (22) 

where  fi'(t)and  f' (t)  are  the  in-phase  and  quadrature  portions  off'(t).The  complex  envelope  of  the 
target  impulse  response  is  defined  slightly  different  manner  so  that 

h(t)  =  hi(t)  +  jh,(t)  (23) 

h(t)  =  Re|2  h(t)exp(j2rfot)}  =  2hj(t)cos(2jtfot)-  j2hq(t)sin(2rfot)  (24) 

and 

H(f )  =  [H(f  +  f„  )  +  sgn(f  +  fo  )  H(f  +  4  )]  /  2  (25) 

where  hi(t)  and  h<,(t)  are  the  in-phase  and  quadrature  portions  of  h(t).  The  complex  envelopes  are 
defined  so  that 

g(t)=h(t)*f  =  (t).  ^  ^  (26) 

According  to  (26),  the  complex  envelope  of  g(t)  can  be  expressed  in  terms  of  F'(f)  and  H(f)  so  that 

Af/2 

g(t)=  |F‘(f)H(f)exp(j2itft)df.  (27) 

-Af/2 

The  radar  profile  p(t)  is  the  magnitude  squared  of  (27).  For  the  Blackman  window 

rect(f /Af)  L  ^  j  +  £2  cos[47C  f/Af]l .  (28) 

Af  [  ao  ao  J 

The  quantity  H(f)  in  (27)  is  determined  from  the  positive  fi-equency  portion  of  H(f)  as  illustrated  in 
(25)  so  that 

H(f)  =  H(f  +  fJ.  (29) 

Here  the  radar  profile  is  calculated  fi-om  H(f)  using  (27),  (28),  and  (29).  The  Fourier  transform  in  (27) 
is  calculated  fi-om  the  values  of  H(f)  (determined  at  discrete  values  of  f  fi-om  the  time-harmonic 
electromagnetic  calculations)  using  the  complex  fast  Fourier  transform  (FFT). 

Results  and  Conclusions 

This  section  presents  the  radar  profile  of  a  number  of  2D  targets.  The  idea  of  a  reference  plane 
is  useful  in  interpreting  the  radar  profiles  given  here.  The  reference  plane  passes  through  the  origin  of 
the  scatterer  and  is  orthogonal  to  the  unit  normal  pointing  in  the  direction  of  the  incident  wave,  k .  A 

local  scatterer  on  the  target  will  result  in  a  peak  in  the  radar  profile  at  a  time  corresponding  to  twice  the 

distance  fi'om  the  reference  plane  to  the  scatterer  along  a  k  directed  path. 
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Figure  1  illustrates  the  envelope  of  the  equivalent  transmitted  signal,  f(t),  used  here.  For  this 
illustration  and  the  calculations  in  this  section  the  center  frequency,  4,  is  10  GHz  and  the  bandwidth, 
Af ,  is  4  GHz.  The  center  frequency  was  chosen  arbitrarily;  however,  the  bandwidth  was  chosen  so  that 
a  time  resolution  of  approximately  0.5  ns  is  achieved.  The  Blackman  window  has  a  first  time-domain 
side-lobe  which  is  -58  dB  below  the  peak  and  a  side-lobe  fall-off  rate  of  -18  dB/octave. 

Figure  2  illustrates  aim  wide  PEC  strip  with  a  TE  polarized  incident  plane  wave.  The  strip  is 
at  a  45  degree  angle  to  the  direction  of  the  incident  plane  wave  and  the  origin  is  1  m  from  the  center  of 
the  strip.  Figure  3  illustrates  the  radar  profile  of  the  PEC  strip.  The  first  two  return  pulses  in  Figure  3 
are  due  to  edge  diffraction  from  the  lower  and  upper  edges  of  the  strip.  The  latter  pulses  are  due  to 
waves  traveling  up  and  down  the  strip  that  are  then  diffracted  at  the  upper  or  lower  edge.  Figure  3 
demonstrates  that  the  simulation  technique  does  an  excellent  job  of  positioning  in  time  the  return  from 
edges  and  traveling  waves.  The  amplitudes  of  the  returns,  however,  differ  in  some  cases  from  the  MM 
results.  We  believe  that  these  differences  are  due  to  the  finite  thickness  of  the  strip  in  the  simulation 
technique.  In  the  simulation  technique,  the  1  m  strip  is  modeled  using  100  circular  cylinders  each  of 
radius  5  mm. 

Figure  4  illustrates  a  45  cm  by  10  cm  rectangular  cavity  with  a  plane  wave  incident  upon  the 
open  end  of  the  cavity.  The  center  of  the  open  face  of  the  cavity  is  1  m  from  the  origin.  The  cavity  is 
simulated  by  102  cylinders  each  of  radius  5  mm.  Figure  5  illustrates  the  radar  profile  of  the  rectangular 
cavity  for  a  TM  polarized  plane  wave.  The  first  return  pulse  in  Figure  5  is  due  to  edge  diffraction  from 
the  front  edge  of  the  cavity.  The  second  return  pulse  is  due  to  the  wave  which  travels  down  the  parallel- 
plate  waveguide  in  the  lowest  order  TM  mode  and  is  reflected  from  the  back  inside  face  of  the  cavity. 
The  third  return  pulse  appears  to  be  due  to  a  wave  which  travels  down  the  waveguide  in  a  fifth  order 
mode  and  is  reflected  from  the  back  face  of  the  cavity.  The  return  after  the  third  pulse  cannot  be 
classified  as  separate  return  pulses.  This  portion  of  the  radar  profile  is  the  resonant  response  of  the 
cavity. 
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Radar  profile  of  a  PEC  strip  with  a  TE  polarized  incident  wave. 
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Abstract 

Presently,  most  computer  code  development  in  the  Computational  Electromagnetics  (CEM) 
community  is  done  in  the  structured  programming  paradigm,  particularly  with  the  FORTRAN  language. 
Other  segments  of  the  programming  community  have  switched  to  the  Object  Oriented  Programming 
(OOP)  paradigm  in  recent  years  in  order  to  ease  the  development  of  highly  complex  codes.  This  paper 
examines  how  OOP  is  being  used  in  the  design  of  a  time-domain  numerical  analysis  CEM  code. 
Comparisons  will  be  made  between  the  OOP  code  and  a  structured  programming  paradigm  code  in  terms 
of  code  maintenance,  portability,  flexibility,  and  speed. 

Introduction 

There  has  been  a  great  deal  of  work  done  in  the  Electromagnetics  community  in  recent  years 
involving  the  development  of  numerical  algorithms  that  will  solve  EM  problems  quickly  and  accurately. 
The  vast  majority  of  these  codes  have  been  written  in  a  “conventional”  manner,  using  the  FORTRAN 
language  and  the  structured  programming  paradigm.  There  are  a  number  of  aspects  of  CEM  that  make 
the  task  of  writing  numerical  analysis  quite  challenging.  These  include: 

•  Parallelism 

•  Efficient  memory  usage 

•  Boundary  conditions 

•  Multiple  coordinate  systems 

•  Hybrid  methods  using  more  than  one  numeric  algorithm 

•  Handling  rectangular  structured,  conformal  structured,  and  unstructured  grids 

Object  Oriented  Programming  (OOP)  is  a  paradigm  that  has  been  used  to  replace  structured 
programming  for  the  production  of  very  large,  complex  computer  programs.  Object-oriented  languages 
such  as  C-I-+  have  rapidly  gained  acceptance  in  many  programming  communities.  In  some  areas  of 
numerical  analysis,  such  as  Computational  Fluid  Dynamics,  object-oriented  codes  have  made  an 
appearance  [1]. 


How  do  we  define  efficiency? 

A  goal  of  every  programmer  is  efficient  code.  In  the  traditional  programming  paradigm, 
efficiency  meant  speed  of  execution.  A  better  metric  might  be  the  life-cycle  costs  of  the  code.  In  addition 
to  execution  time,  life-cycle  costs  include  the  time  to  develop  the  application  code,  time  spent 
maintaining  it,  and  time  spent  building  the  input  file.  Money  also  must  be  factored  in;  the  developer  must 
know  whether  the  end  user  is  willing  to  pay  for  additional  speed  or  a  “friendlier”  interface.  There  are 
numerous  examples  of  situations  where  a  user  may  take  days  (or  longer)  to  build  a  model  that  can  be 
solved  in  hours  on  a  modem  supercomputer.  Using  the  old  metric,  spending  2  man-months  modifying  a 
code  so  that  it  would  run  50%  faster  made  sense.  Under  the  life-cycle  costs  metric  this  might  not  be  cost 
effective.  In  addition,  the  life-cycle  costs  metric  implies  that  a  slower-executing  code  that  is  easier  to 
maintain  may  be  more  efficient  than  a  faster  code  that  is  difficult  to  maintain.  This  metric  caused  the 
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move  away  from  assembly  languages  towards  high-level  languages  for  application  programs.  In  recent 
years,  this  same  metric  caused  many  software  engineers  to  move  from  the  structured  programming 
paradigm  to  the  OOP  paradigm. 


What  is  OOP? 

OOP  is  a  programming  paradigm;  a  conceptual  model  of  how  a  computer  program  should  look. 
The  structured  programming  paradigm  is  modeled  after  the  microprocessor.  A  series  of  “machines” 
called  procedures  or  subroutines  execute  operations  on  data.  In  structured  programming,  data  and 
functions  are  separate  entities.  Data  are  always  passive  (operated  upon)  and  functions  are  active. 

In  the  real  world  the  separation  between  data  and  functions  is  not  so  clear.  An  engineer,  for 
example,  is  a  person  with  a  name,  address,  etc.  An  engineer  also  has  skills  which  can  be  applied  to 
problems.  An  engineer  is  really  a  “thing”  with  both  attributes  (data)  and  methods  (functions).  In  the 
OOP  paradigm,  “things”  are  called  objects.  Each  object  has  attributes  and  methods  that  allow  it  to 
accurately  represent  a  specific  concept.  In  OOP,  data  takes  the  active  role  because  it  knows  what  to  do 
through  its  methods  [2].  The  OOP  paradigm  can  be  used  to  some  extent  in  any  language,  even 
FORTRAN  [3].  However,  hybrid  OOP  languages  such  as  C++  and  pure  OOP  languages  such  as 
Smalltalk  contain  language  constructs  that  allow  the  attributes  and  methods  of  objects  to  be  bound 
together. 


Encapsulation 

There  are  three  main  concepts  in  OOP  -  encapsulation,  inheritance,  and  polymorphism. 
Encapsulation  involves  the  the  way  attributes  and  methods  of  an  object  are  packaged  together.  With  foil 
encapsulation  the  attributes  of  an  object  are  accessible  only  to  other  objects  of  the  same  type.  An  object 
type  (in  C++  the  word  “class”  is  used)  defines  specific  attributes  and  methods  that  an  object 
possesses. When  encapsulation  is  enforced,  the  outside  world  must  access  the  values  of  an  object’s 
attributes  through  its  methods.  Pure  OOP  languages  require  full  encapsulation  while  hybrid  OOP 
lansuat^es  let  the  programmer  decide  the  level  of  encapsulation.  Encapsulation  allows  implementation 
details  of  the  high-level  concept  the  object  represents  to  be  hidden  from  the  outside  world.  The  object 
becomes  a  “black  box”,  limiting  the  possibility  for  error  by  overwriting  memory  or  applying  functions 
incorrectly.  An  end  result  is  more  readable  and  portable  application  code. 

Object  methods  may  be  divided  into  several  types.  First,  there  are  constructors  and  destructors. 
A  constructor  is  used  to  create  an  object,  similar  to  declaring  a  variable.  Memory  required  for  an  object  is 
allocated  as  the  object  is  created,  which  can  be  at  any  point  in  the  code.  The  destructor  deletes  the  object, 
freeing  up  memory  allocated  to  the  object.  NormaUy  destructors  are  called  automatically  when  objects  go 
out  of  scope.  For  example,  an  object  created  in  a  loop  goes  out  of  scope  at  the  exit  from  the  loop  and  at 
this  time  the  destructor  is  called.  This  dynamic  memory  allocation/deallocation  makes  memory 
management  much  easier  in  C+  than  it  would  be  in  FORTRAN.  Another  method  is  the  accessor,  which 
is  used  by  foreign  objects  to  access  values  of  data  inside  an  object.  Accessors  allow  the  object  creator  to 
give  specific  read/write  access  to  each  object  attribute.  Many  objects  contain  a  print  method.  Instead  of 
writing  an  external  function  that  outputs  an  object’s  attributes,  the  object’s  print  method  is  invoked, 
causing  the  object  to  print  itself. 


Inheritance 

OOP  languages  allow  creation  of  objects  that  are  “children”  of  another  object.  In  C++,  the  terms 
“base  class”  and  “subclass”  are  used.  Inheritance  is  the  ability  of  a  subclass  to  inherit  all  of  its  parent’s 
attributes  and  methods.  Usually  the  subclass  represents  a  specialization  of  the  base  class.  The  definition 
of  the  subclass  only  needs  to  include  those  methods  and  attributes  that  are  not  part  of  the  general  base 
class.  In  addition,  methods  that  are  diffemnt  in  the  subclass  than  in  the  base  class  may  be  redefined. 
Anything  that  doesn’t  change  need  not  be  included  in  the  subclass  definition. 

Inheritance  is  a  powerful  property  in  terms  of  code  reuse.  Objects  can  be  used  over  and  over  in 
new  applications,  either  as  is,  or  as  base  class  objects  for  some  new  specialized  object.  Inheritance  also 
allows  the  complexity  of  a  concept  to  be  spread  across  a  number  of  classes.  A  good  example  in  the 
present  code  is  the  Feed  Class,  which  represents  the  forcing  function  applied  to  the  EM  time-domain 
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boundary-value  problem.  The  base  Feed  Class  contains  all  the  information  that  is  common  to  all  forcing 
functions. The  primary  method  or  service  is  the  application  of  a  time-dependent  source  term  to  the  EM 
fields  in  a  problem  space.  The  nature  of  the  time-dependent  source  term  (e.g.  pulse  or  sinusoid)  is  an 
attribute  as  is  the  feed  region  definition  (the  region  that  the  forcing  function  is  applied  over).  There  are  a 
number  of  specializations  of  the  base  Feed  Class.  For  example,  the  weighting  across  the  feed  region  is 
different  for  a  coax  feed  than  for  a  rectangular  waveguide.  A  plane-wave  forcing  function  (used  in  a 
scattered  field  formulation)  has  a  very  different  implementation  from  a  fixed  region  feed  used  in  a  total 
field  formulation.  These  differences  are  contained  in  the  subclass  definitions. 

Polymorphism 

The  third  and  possibly  most  important  concept  in  the  OOP  paradigm  is  polymorphism, 
representing  an  abstract  concept  in  different  ways.  The  application  programmer  works  at  a  high-level, 
using  object  methods  to  accomplish  tasks  without  worrying  about  the  implementation  details  of  the 
methods.  If  different  forms  of  a  single  method  are  used  to  accomplish  the  same  task,  then  polymorphic 
functionality  is  present.  The  key  to  polymorphism  is  allowing  run-time  binding.  Objects  can  invoke 
methods  of  other  objects  without  knowing  the  type  (or  class)  of  the  other  object.  In  C++  this  is  done 
with  the  virtual  function.  While  C++  is  strongly  typed,  a  base  class  pointer  may  be  used  to  point  to  a 
sub-class  object.  When  a  virtual  function  is  invoked  through  a  base  class  pointer,  the  base  object 
method  is  not  called.  Instead,  the  object  method  of  the  object  the  pointer  points  to  is  called. 
Polymorphism  builds  upon  the  previous  two  concepts.  Encapsulation  ensures  that  the  outside  world 
works  with  an  object  method  rather  than  working  directly  upon  the  object’s  attributes.  Inheritance 
provides  for  subclasses  that  contain  the  same  methods  with  different  implementations. 

An  example  of  polymorphism  used  in  this  computer  code  is  the  Parallel  Class.  The  Parallel  Class 
handles  the  work  of  spawning  child  processes,  deciding  which  processes  are  neighbors,  deciding  the 
domain  decomposition,  and  handling  interprocess  communication.  The  implementation  is  a  function  of 
the  message  passing  library  used  (e.g.  PVM,  APPL).  So  a  base  Parallel  Class  can  have  several  sub¬ 
classes,  each  of  which  corresponds  to  a  different  message  passing  library  that  uses  library  specific 
commands.  A  default  Parallel  Class  whose  methods  do  nothing  would  be  used  on  a  serial  machine. 

Algorithmic  polymorphism 

Hybrid  method  computer  codes  are  by  definition  polymorphic.  Each  implementation  is  capable 
of  reaching  the  same  result  (accurate  computation  of  EM  fields)  using  different  algorithms.  For  the 
present  work,  the  hybrid  method  utilizes  Finite  Difference  Time  Domain  (FDTD)  and  Finite  Volume 
Time  Domain  (FVTD).  Both  methods  use  fields  discretized  over  cell  grids.  Both  methods  (in  at  least 
some  formulations)  use  similar  processes  (i.e.  update  E  fields,  step  in  time,  update  H  fields,  step  in 
time,  apply  boundary  conditions,  save  output,  repeat).  FDTD  is  much  more  memory  efficient  since  it  is 
not  necessary  to  describe  each  cell  individually.  FVTD  has  the  advantage  of  being  applicable  to 
unstructured  grids  as  well  as  structured  grids.  Each  algorithm  can  be  represented  by  a  subclass  of  the 
same  base  class  that  would  contain  the  common  methods  (e.g.  update  E  fields,  etc.).  An  application  of  a 
hybrid  FDTD/FVTD  method  is  a  complex  conformal  object  surrounded  by  a  regular  grid  that  contains 
the  outer  boundary  radiating  condition  [4,5].  This  hybrid  method  also  could  be  used  to  solve  a  problem 
entirely  with  FDTD  or  FVTD.  By  standardizing  the  interface  between  regions  where  different  algorithms 
are  used,  it  is  possible  to  have  communication  between  these  regions. 

One  implementation  method  would  divide  the  problem  space  into  subspaces.  Each  subspace  uses 
either  FDTD  or  FVTD  and  is  represented  by  the  appropriate  subclass  object.  The  subspaces  are  collected 
into  an  array  or  list  that  contains  base  class  pointers  to  the  subspace  objects.  The  application  code  then 
traverses  the  list,  invoking  the  methods  for  each  subspace  object  in  turn  without  knowing  how  the  object 
will  accomplish  its  task. 


Data  Parallelism 

Both  FDTD  and  FVTD  lend  themselves  to  data  parallelism  implemented  as  domain 
decomposition.  From  the  viewpoint  of  the  application  programmer,  the  mechanics  of  the  domain 
decomposition  should  be  hidden.  Just  as  each  process  working  on  a  subset  of  the  problem  space  acts  as 
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if  it  is  working  on  the  entire  problem,  the  application  programmer  and  user  need  to  work  on  the  entire 
problem  as  if  it  is  not  broken  up.  In  the  present  case  two  class  hierarchies,  the  Field  Class  and  the  Focus 
Class,  work  together.  The  Field  Class  contains  the  discrete  EM  field  data  for  the  problem  space.  The 
data  set  is  amorphous,  neither  structured  nor  unstructured.  The  Focus  Class  provides  the  framework  for 
describing  how  the  Field  Class  data  should  be  ordered.  An  analogy  is  that  the  Field  Class  is  tent  fabric 
and  the  Focus  Class  is  the  set  of  tent  poles  defining  the  shape  of  the  tent.  The  Focus  Object  knows  what 
subset  of  a  Field  Object  to  iterate  over  and  how  to  iterate  over  that  subset.  Tins  subset  is  ^so  a  function 
of  the  data  decomposition  among  processes.  The  Focus  Object  takes  position  information  listed  in  a 
<^lobal  sense  and  converts  it  to  process  local  information  that  is  used  to  control  iterations  over  the  Field 
Classes  (Fig.  1).  Since  this  algorithm  is  implemented  in  different  ways  for  structured  and  unstructured 
grids,  different  sub-classes  of  the  Focus  Class  will  be  needed  for  these  grids. 


Physical  Continuous  Application 

Reality  EM  Fields 


Discrete 
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Discrete 
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Data  block  Unstructured 
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Figure  1.  Relationship  between  levels  of  abstraction  and  the  OOP  classes. 
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Both  FDTD  and  FVTD  use  nearest  neighbor  information  and  therefore  require  message  passing 
of  EM  field  information  between  processes.  The  Focus  Class  works  in  conjunction  with  the  Parallel 
Class  for  this  task  with  the  Focus  Object  containing  a  Parallel  Object  as  an  attribute.  The  Parallel  Object 
determines  whether  data  must  be  passed.  If  so,  the  Focus  Object  loads  a  message  with  the  appropriate 
data  and  sends  it  using  member  functions  (methods)  of  its  Parallel  Object. 

The  dynamic  memory  allocation  ability  of  C-H-i-  is  very  useful  for  data  parallelism.  Arrays  can  be 
dimensioned  to  the  correct  size  as  a  function  of  the  number  of  processes  used.  The  code  uses  the 
minimum  amount  of  memory  without  recompilation  every  time  the  number  of  processors  is  adjusted. 
This  is  particularly  important  for  people  using  clusters  of  workstations  that  have  load  distributions  that 
vary  from  day  to  day. 


Boundary  Conditions 

Boundary  conditions  often  represent  one  of  the  most  difficult  aspects  of  nunierical  analysis 
programmin'^  Boundary  conditions  often  use  a  small  fraction  of  the  code  s  execution  time  while 
coinprising  a  large  number  of  lines  of  actual  source  code.  The  boundary  condition  is  a  concept  that  can 
be  represented  by  a  class  hierarchy.  All  boundary  conditions  provide  the  same  service;  modification  of 
fields  in  a  subset  of  the  problem  space.  In  addition,  all  boundary  conditions  use  data,  mostly 
multiplicative  constants  required  by  the  boundary  condition.  The  actual  data  needed  and  the 
implementation  used  vary  from  boundary  condition  to  boundary  condition.  Differences  in 
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implementation  can  be  handled  by  creating  different  sub-classes  to  represent  the  different  boundary 
conditions,  such  as  1st  order  Mur  or  2nd  order  Liao. 

One  challenge  of  applying  boundary  conditions  is  that  an  operation  is  applied  only  to  a  portion  of 
the  EM  fields  in  the  problem  space.  Since  the  Focus  Object  handles  iteration  for  the  data  set  in  the  Field 
Object,  the  Focus  Object  must  also  keep  track  of  what  portion  of  the  data  set  is  to  be  iterated  over  (the 
active  region).  This  service  also  is  needed  for  applying  forcing  functions  and  applying  material  attributes 
(Fig.  2).  The  boundary  condition  may  contain  smaller  Field  Objects  that  cover  only  the  boundary  region. 
The  Focus  Object  must  ensure  that  a  mathematical  operation  on  two  Field  Objects  only  occurs  if  the 
objects  have  the  same  size  active  region. 


Typical  subregions  in  a  gridded  problem 

Outer  __  _  Process  p  Feed  region 

boundary  boundary  (forcing  term) 

Figure  2.  There  are  various  subsets  of  the  entire  grid  that  must  be  operated 
on  during  the  course  of  the  numerical  analysis. 


Comparisons  with  a  structured  programming  code 
The  present  C++  code  is  compared  to  FDTD__ANT,  a  non-conformal  structured-grid  FDTD 
program  that  is  based  upon  the  Luebbers  and  Kunz  FDTD  A  code  [6].  Both  codes  use  the  same  FDTD 
formulation  and  may  be  used  with  either  scattered  field  or  total  field  formulation.  These  codes  are 
compared  for  code  development,  code  maintenance,  flexibility,  portability,  memory  efficiency,  and 
speed  efficiency.  A  version  of  the  FDTD_ANT  code  is  parallelized  for  domain  decomposition  along  any 
one  of  the  three  Cartesian  axes.  The  C++  code  may  be  domain  decomposed  along  any  combination  of 
the  three  axes  (including  all  three  at  once). 

Code  Development 

Writing  an  OOP  code  often  involves  more  design  overhead  than  writing  a  similar  code  in  the 
structured  programming  paradigm.  In  Object  Oriented  Design  (OOD)  the  overall  scenario  must  be 
divided  into  individual  concepts  which  are  modeled  by  objects.  The  designer  then  decides  whether  the 
concept  is  better  represented  by  one  object  or  a  hierarchy  of  objects.  In  some  cases,  classes  may  already 
exist  that  have  some  of  the  necessary  capabilities.  Using  existing  classes  as  a  foundation  for  new  classes 
is  a  hallmark  of  code  reuse.  Note  that  the  term  code  reuse  refers  to  using  already  created  and  tested 
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classes  in  different  applications.  Code  reuse  is  different  from  code  copying,  which  is  t^ng  portions  of 
codffrom  one  fde  and  copying  them  into  another  fde.  With  code  reuse  less  total  code  is  created  and 
there  are  fewer  opportunities  for  typographical  errors  than  with  code  copying. 

Code  maintenance 

Code  maintenance  is  a  key  issue  in  life-cycle  costs  since  most  CEM  codes  are  undergoing  almost 
constant  modification  The  OOP  paradigm  specifically  addresses  code  maintenance  in  several  ways. 
Encapsulation  makes  the  code  more  modular,  which  prevents  new  code  from  affecting  unrelated  easting 
code^In  addition  much  less  global  data  (including  common  blocks)  exists,  making  it  less  likely  that 
memory  will  be  overwritten.  The  additional  code  is  usually  simpler  to  write.  For  example,  the  iterator 
Ses  of  The  Field  Object  are  all  encapsulated  within  the  Focus  Object  that  is  itself  encapsulated  within 
the  Field  Object.  Application  code  using  Field  Objects  contain  no  indices  or  do-loops  ;  U®®  °5 
free”  notation  at  the%plication  code  level  makes  the  code  easier  to  read  and  avoids  the  potential  for 
typographical  errors  that  often  accompany  nested  “do-loops”. 

Flexibility 

The  OOP  code  is  far  more  flexible  due  to  polymorphism.  By  using  class  hierarchies  that  contain 
virtual  functions  and  functors  (functions  that  are  really  objects  and  can  be  treated  like  data)  «  mu®h 
easier  to  write  hybrid  method  codes.  In  particular,  it  is  possible  to  write  hybnd  method  codes  that  don  t 
appear  to  use  multiple  methods  or  algorithms.  The  versatility  of  code  can  be  increased  by  adding 
sXlasses  which  need  only  define  the  methods  whose  implementation  has  changed. 

Portability 

One  clear  portability  advantage  that  emerged  involved  parallelization.  The  FORTR^  code  has 
two  versions  a  serial  version  and  a  parallel  version.  To  execute  the  parallel  version  on  a  senal  machine 
would  be  necessary  to  remove  all  the  message  passing  library  calls  since  that  libr^  would  not  be 
rnSled  on  a  se?Sl  machine.  In  the  OOP  paradigm,  each  type  of  computer  would  have  a  library 
containin'^  the  Parallel  subclass  relevant  to  the  computer’s  configuration.  A  send  macbne  would  have  a 
“stub”  subclass  that  would  do  nothing.  The  application  code  which  only  deals  with  the  base  Parallel 
Class  becomes  completely  portable.  This  portability  could  just  as  easily  be  applied  to  ote  classes,  with 
Shi  ophSTgorithm^  kernels  bein|  stored  in  Class  libraries  that  vary  from  machine  to  machine^ 
AreTation  codes  would  invoke  the  methods  in  these  libraries  without  having  to  know  the 
implementiation  details. 

Memory  efficiency 

FORTRAN  tends  to  be  very  memory  inefficient  due  to  the  lack  of  dynamic  memory  allocation. 
This  is  a  problem  for  parallel  codes  where  the  data  set  size  is  a  function  of  the  number  of  Processes 
runnin'J  The  C  language  has  dynamic  memory  allocation  through  the  malloc  command,  but 
additional  command^must  be  used  in  C  to  deallocate  memory.  The  C++  language  goes  a  step  further 
with  automatically  called  destructors  that  take  care  of  memory  deallocation. 

Speed  efficiency 

While  the  project  has  not  concentrated  on  speed  of  execution,  it  is  ®®rt^nlyM  important  issue^ 
At  the  nresent  time  the  OOP  code  runs  significantly  slower  than  the  structured  FORTRAN  codeia 
fector  (rf  8  5.  In  reference  7,  the  authors  compared  versions  of  a  pmputational  Fluid  Dyna^cs  (C^ 
code  written  in  FORTRAN,  C,  and  C++  (the  first  two  codes  used  structured  programming  and  the  C++ 
code  Ise^OOP)  On  Variety  of  machines  the  C  code  was  faster  than  the  C++  code  by  a  factor  of 
rou°hIv  2  (the  FORTRAN  code  was  roughly  equivalent  to  the  C  code  m  speed)  For  many  nuinerical 
appS^s  Ais  would acceptable  iSssl  execution  efficiency.  The  work  in  [7]  suggests  that  many 
things  could  be  done  to  make  the  present  OOP  code  faster. 

^^osvilts 

To  date,  an  OOP  code  has  been  written  in  C++.  This  code  is  fully  parallelized  (3 -dimensional 
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domain  decomposition)  using  the  PVM  message  passing  library.  It  uses  the  FDTD  algorithm  on  a 
structured  grid  with  either  PML  or  1st  order  Mur  boundary  conditions.  Agreement  with  the  FDTD_ANT 
code  is  good  for  a  sample  problem  (agreement  is  not  exact  since  the  codes  use  different  boundary 
conditions).  At  present  the  unstructured  grid  classes  and  FVTD  solver  are  being  added  to  the  code  and 
will  be  further  reported  on  during  the  presentation. 
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Introduction 


It  has  been  recently  shown  that  the  use  of  wavelet-like  basis  functions  in  the 
finite  element  solution  of  elliptic  problems  yields  a  global  matrix  that,  after  simple 
diagonal  preconditioning,  has  a  condition  number  that  is  bounded  by  a  constant  as 
the  number  of  points  of  discretization  is  increased  [1].  This  contrasts  sharply  with 
the  situation  that  arises  when  the  traditional  finite  element  or  finite  difference 
method  is  used;  in  either  of  these  cases  the  usual  preconditioning  methods  make 


the  condition  number  01  -  ;  and  if  simple  diagonal  preconditioning  is  used,  the 

\hj 


condition  number  will  go  to  infinity  as,  at  best,  ^  as  the  number  of  points  of 

discretization  is  increased,  i.e.,  as  h  goes  to  zero.  This  is  important  because  when 
iterative  algorithms  are  employed,  ill  conditioning  can  lead  to  numerical 
instabilities,  slow  convergence,  or  even  lack  of  convergence.  Thus,  the  use  of 
wavelet-like  basis  functions  appears  to  have  significant  potential  as  an  efficient 
method  for  solving  elliptic  problems  over  large  and  highly  heterogeneous  problem 
domains. 

A  wavelet  is  a  function  that  is  localized  in  both  the  spatial  and  frequency 
domains  and  is  such  that  any  function  can  be  expressed  in  terms  of  translations 
and  dilations  (or  compressions)  of  the  wavelet.  Tliis  is  a  relatively  new  area  of 
research  in  computational  electromagnetics.  Steinberg  and  Leviatan  discuss  the 
use  of  wavelet  expansions  in  an  integral  equation  technique  in  [2].  In  [1]  and  [3], 
Jaffard  and  Laurencot  present  wavelet  mediods  for  elliptic  problems.  In  the  1994 
IEEE  AP-S  Symposium  and  URSI  Meeting,  there  were  several  presentations 
related  to  wavelets,  including  [4] -[7]. 
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Construction  of  the  Basis  of  Wavelet-like  Functions 


The  technique  that  is  employed  in  this  paper  is  explained  in  detail  in  [1];  its 
application  in  this  paper  can  be  summarized  as  follows.  An  initial,  coarse 
discretization  of  the  problem  domain  is  chosen.  The  traditional  finite  element  basis 
functions  associated  with  the  nodes  in  this  initial  discretization  are 
orthonormalized  with  respect  to  each  other  using  the  procedure  described  in  [1]. 
The  resulting,  orthonormalized  functions  are  the  first  functions  in  the  basis  of 
wavelet-like  functions.  If  finer  discretization  is  desired,  new  nodes  are  introduced 
at  the  midpoints  of  the  intervals  between  the  existing  nodes.  The  traditional  finite 
element  basis  functions  associated  with  these  new  nodes  are  then  orthogonalized 
against  the  functions  in  the  basis.  Then,  the  resulting  functions  are 
orthonormalized  with  respect  to  each  other.  The  functions  that  result  from  this  are 
then  added  to  the  basis.  This  procedure  is  repeated,  with  more  nodes  being 
introduced  into  the  mesh  and  new  functions  being  added  to  the  basis  at  each  step, 
until  the  discretization  is  as  fine  as  desired.  Then,  the  standard  finite  element 
method  is  employed  using  the  wavelet-like  functions  constructed  in  this  procedure. 

As  an  illustration  of  these  ideas,  consider  the  finite  element  solution  of  the 
problem: 

-V  •  {aVu)  +  bu  =  g  on  Q. 
u  =  h  on 
a(x)>C>0 
b(x)>0 

For  the  sake  of  simplicity,  consider  a  one  dimensional  problem  in  which  Q  is  the 
line  segment  from  x=0  to  x-3  and: 

a{x)  =  l.l  +  sin(25j:) 
b(x)  =  lO 
g(x)  =  \0,0x 
m(0)  =  2.0 
w(3)  =  4.0 

When  the  technique  described  in  [2]  is  used  to  solve  this  problem,  the  first  few 
wavelet-like  basis  functions  are  as  shown  in  figures  1-5.  (Figure  5  is  presented 
simply  to  provide  an  uncluttered  view  of  one  of  the  wavelet-like  basis  functions 
shown  in  figure  4.)  Figure  1  shows  the  first  function  added  to  the  basis,  the  initial, 
coarse  discretization  used  in  this  case  contains  only  one  interior  node  which  is 
located  at  the  midpoint  of  the  interval.  In  the  next  step,  one  node  is  placed  at  the 
midpoint  of  each  of  the  two  sub-intervals  of  the  initial  discretization.  The 
traditional  finite  element  basis  function  associated  with  each  of  these  nodes  is  then 
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orthogonalized  against  the  function  shown  in  Figure  L  Then,  the 
orthonormalization  procedure  is  applied  to  the  two  resulting  functions  to  yield  the 
final  wavelet-like  functions  shown  in  Figure  2.  These  functions  are  orthogonal  to 
each  other  and  to  the  function  shown  in  Figure  1  as  well.  At  this  point  there  are 
three  interior  nodes  in  the  discretization  and  three  wavelet-like  functions  in  the 
basis.  At  the  next  step,  nodes  are  placed  at  the  midpoints  of  each  of  the  current 
four  sub-intervals.  The  above  procedure  is  repeated  to  yield  the  four  wavelet-like 
functions  shown  in  Figure  3.  The  next  stage  in  the  procedure  yields  the  eight 
functions  presented  in  Figure  4.  It  should  be  noted  that  the  fifteen  wavelet-like 
basis  functions  shown  in  Figures  1-4  span  the  same  space  as  do  the  fifteen 
traditional  finite  element  basis  functions  described  by: 


8i(x)  = 


0 

x-(i-l)A 

A 

(i  +  l)A-x 

A 

0 


forx<(i-l)A 
for  (i  -  1)A  <  X  <  iA 

for  iA  <  X  <  (i  +  1)A 
forx>(i  +  l)A 


fori  =  l,  2,  15 


3 

where  A  =  — .  This  procedure  is  continued  until  the  discretization  is  as  fine  as 
16 

desired. 


Numerical  Results 

The  advantage  of  using  the  wavelet-like  basis  functions  rather  than  the 
traditional  finite  element  basis  functions  is  demonstrated  in  figures  6-9.  The 
problem  above  was  solved  using  3,  7,  15,  31,  63,  127,  255,  511,  1023,  and  2047 
wavelet-like  basis  functions  and  3,  7,  15,  31,  63,  127,  255,  511,  1023,  and  2047 
traditional  basis  functions.  In  figure  6,  the  condition  number  of  the  preconditioned 
global  matrix  is  shown  for  these  various  numbers  of  basis  functions  for  both 
wavelet-like  and  traditional  finite  element  basis  functions.  (When  wavelet-like 
basis  functions  were  employed,  simple  diagonal  preconditioning  was  used;  when 
the  traditional  finite  element  basis  functions  were  employed,  incomplete  Cholesky 
preconditioning  was  used.)  As  can  be  seen  in  this  figure,  the  effect  on  the 
condition  number  of  using  wavelet-like  rather  than  the  traditional  basis  functions  is 
dramatic;  while  the  condition  number  climbs  to  more  than  2.5  million  when  the 
traditional  basis  functions  are  used,  it  reaches  a  peak  value  of  65.48  and  then 
actually  declines  as  the  number  of  basis  functions  increases,  when  the  wavelet-like 
basis  functions  are  used.  In  fact,  the  condition  number  for  the  wavelet  cases  is  so 
much  smaller  than  that  for  the  traditional  basis  functions  that  the  details  of  the 
behavior  cannot  be  seen  in  figure  6;  thus  figure  7  is  presented  to  provide  a  more 
detailed  picture  of  the  behavior  of  the  condition  number  when  the  wavelet-like 
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functions  are  used.  Clearly,  even  though  a  much  simpler  preconditioiung  is  being 
used  in  the  wavelet  than  in  the  traditional  cases,  the  global  matrix  arising  from  the 
use  of  wavelet-like  functions  is  much  better  conditioned  than  that  arising  from  the 
use  of  the  traditional  basis  functions.  The  importance  of  this  is  demonstrated  in 
figures  8  and  9.  In  these  figures  we  see  plots  of  the  number  of  steps  required  for 
the  convergence  of  the  conjugate  gradient  algorithm  for  each  of  the  cases  presented 
in  figures  6  and  7.  (The  conjugate  gradient  algorithm  was  applied  using  the 
preconditioned  global  matrix  for  which  the  condition  number  is  plotted  in  figiures  6 
and  7.)  In  figure  6  it  is  seen  that  when  the  traditional  basis  functions  are  used,  the 
number  of  steps  required  for  convergence  increases  without  bound  as  the  number 
of  basis  functions  is  increased;  but  when  the  wavelet-like  basis  functions  are 
employed,  the  number  of  steps  remains  quite  low.  In  fact,  as  was  Ae  case  in  figure 
6,  the  performance  obtained  using  the  wavelet-like  basis  functions  is  so  much 
bkter  than  that  obtained  using  the  traditional  basis  functions  that  it  is  impossible  to 
depict  both  in  detail  on  the  same  scale;  so,  figure  9  is  presented  to  provide  a  more 
detailed  picture  of  the  variation  of  the  number  of  steps  required  when  the  wavelet¬ 
like  functions  are  used.  From  this  picture  we  can  see  that  the  number  of  steps 
required  for  a  matrix  of  order  2,047  is  the  same  as  that  required  by  a  matrix  of 
order  511  and  is  only  11.4%  greater  than  the  number  of  steps  required  for  a  matnx 
of  order  127. 


Conclusions 

In  this  paper,  the  use  of  wavelet-like  basis  functions  in  the  solution  of 
elliptic  problems  over  one-dimensional  domains  is  discussed.  In  this  case,  Ae  use 
of  such  basis  functions  yields  matrices  that,  with  simple  diagonal  preconditioning, 
are  much  better  conditioned  than  are  those  resulting  from  the  use  of  traditional 
finite  element  basis  functions  and  more  sophisticated  preconditioning  schemes. 
This  leads  to  much  more  rapid  convergence  of  the  conjugate  gradient  algorithm 
when  these  wavelet-like  basis  functions  are  employed. 
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Figure  1.  Wavelet-like  basis  function  #1 
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Figure  4.  Wavelet-like  basis  functions  #8-#15 
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Condition  Number 


Figure  6.  Condition  number  as  a  function  of  the  number  of  basis  functions 
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Figure  7.  Condition  number  as  a  function  of  the  number  of  basis  functions 
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Figure  9.  Number  of  steps  required  for  convergence  as  a  function  of  the  number  of 
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1.  Introduction 

In  the  study  of  complex,  large-scale  electromagnetic  problems,  two  factors  are  of  key  importance: 
(1)  a  powerful  computer  with  sufficient  speed  and  memory  space,  and  (2)  a  rigorous  numerical 
technique  characterized  by  a  high  degree  of  accuracy  and  efficiency.  With  the  rapid  advances  in  the 
high  speed  computer  technology,  the  numerical  analysts  are  being  provided  with  ever-increasing 
computing  power.  In  the  meanwhile,  the  numerical  techniques  for  electromagnetics  have  reached 
such  a  level  of  maturity  that  the  full-wave  analysis  of  many  complex  structures  is  no  longer  an 
unrealistic  computational  goal.  As  the  complexity  of  the  geometries  under  study  increase,  the 
speed  factor  becomes  more  prominent  due  to  the  enormous  volume  of  the  computational  load.  It 
is  essential  to  complement  the  speed  of  the  computing  machinery  by  an  increase  in  the  efficiency  of 
the  numerical  codes.  This  prompts  an  urgent  need  for  the  development  of  fast  algorithms  for  the 
numerical  solution  of  electromagnetic  problems. 

The  Method  of  Moments  (MoM)  has  long  been  used  as  one  of  the  primary  tools  for  the  rigor¬ 
ous  study  of  electromagnetic  problems  [1].  This  method  in  conjunction  with  the  integral  equation 
technique  provides  the  most  accurate  and  in  many  cases  an  exact  formulation  of  the  related  the 
boundary  value  problem.  In  spite  of  aU  its  advantages  over  the  other  numerically  intensive  tech¬ 
niques,  the  major  challenge  in  the  method  of  moments  is  the  choice  of  appropriate  basis  functions 
that  would  provide  the  required  accuracy  and  computational  efficiency.  An  impeding  characteristic 
of  this  approach  is  that  the  resulting  matrices  are  fuU  and  have  very  poor  condition  numbers.  This 
challenge  becomes  even  bigger  as  the  geometry  under  study  becomes  non- canonical  and  complex, 
involving  larger  numbers  of  unknowns. 

The  past  two  years  have  witnessed  the  successful  application  of  the  newly  developed  theory 
of  orthonormal  wavelets  to  various  electromagnetic  problems  [2]- [4].  It  has  been  established  that 
the  use  of  wavelet  expansions  in  conjunction  with  the  method  of  moments  leads  to  highly  sparse 
moment  matrices.  This  property  is  based  on  the  fact  that  the  condition  number  of  the  resulting 
matrices  is  so  good  that  performing  a  thresholding  procedure  by  discarding  a  large  number  of  the 
matrix  entries  which  faU  below  a  certain  threshold  level  does  not  degrade  the  accuracy  of  results. 
Such  a  sparsity  makes  it  possible  to  employ  a  wide  range  of  numerical  methods  especially  developed 
for  the  fast  and  efficient  storage  and  inversion  of  this  type  of  matrices. 

The  orthonormal  wavelets  have  some  other  interesting  properties  w'hich  can  be  properly  ex¬ 
ploited  to  speed  up  the  moment  method  computations  significantly.  One  such  feature  is  the  Fast 
mvelet  Algorithm  (FWA)  which  has  been  used  extensively  in  signal  processing  appHcations  [5].  In 
brief,  this  property  enables  one  to  compute  the  wavelet  expansion  coefficients  at  a  certain  resolution 
from  those  of  the  next  higher  resolution.  Thus,  in  a  moment  method  solution,  it  is  possible  to  con¬ 
struct  a  recursive  algorithm  to  compute  all  the  moment  matrix  elements  in  consecutive  resolution 
levels  from  those  of  the  highest  resolution.  In  the  following,  first  we  present  a  short  account  of  the 
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theory  of  one- dimensional  multiresolution  analysis  (MRA)  to  set  up  the  definitions  and  conven¬ 
tions.  Then,  the  fast  wavelet  algorithm  (FWA)  is  described  in  its  one-dimensional  version.  This  is 
followed  by  the  construction  of  two-dimensional  multi  re  solution  analysis  and  the  associated  FWA 
algorithm.  Finally,  the  implementation  of  these  algorithms  for  MoM  speed-up  are  discussed  and 
some  numerical  results  are  presented. 


2.  One-Dimensional  Multiresolution  Analysis 

A  multiresolution  analysis  (MRA)  of  L'^{R)  is  a  sequence  of  nested  approximation  subspaces 
of  denoted  by  such  that  all  subspaces  are  scaled  versions  of  a  central  space 

Vo  obtained  through  dilation  by  integral  powers  of  2  [6].  Each  subspace  Kn,  which  is  said  to 
have  a  resolution  of  2“”^,  possesses  details  twice  as  fine  as  those  of  its  predecessor,  VJn-i,  on 
this  approximation  ladder.  Each  subspace  Vm  by  definition  has  an  orthonormal  basis,  denoted 
by  {<l>m,n}neZ^  which  consists  of  properly  scaled  and  shifted  versions  of  a  function  <p{x),  called 
the  scaling  function,  such  that  -  n).  Given  an  arbitrary  square-integrable 

function  /(a:),  one  can  then  express  its  approximation  at  a  resolution  of  2~^  by  defining  a  projection 
operator  Pmif)  onto  the  subspace  in  the  following  form: 

Xm(/)  =  (l) 

neZ 

where  <  f,g  >  denotes  the  inner  product  of  the  two  functions  f{x)  and  g{x).  We  denote  the 
orthogonal  complement  of  Vm  in  Vm+i  by  Wm  and  write  Vm  0  Wm  =  It  can  be  shown 

that  for  each  complement  subspace  Wm,  there  exists  an  orthonormal  basis,  denoted  by  {'il^m,n}nez, 
which  consists  of  scaled  and  shifted  versions  of  another  function  'tp(x),  called  the  mother  wavelet, 
such  that  '0m,n(2:)  =  2’^/^^(2”^a:  -  n).  Then,  one  can  write; 

Pm  +  l{f)  -  Pm{f)-\-'^  <  f,  fm,n  >  (2) 

n^Z 

It  can  be  shown  that  the  entire  set  {'tpm,n}m,n^z  constitutes  a  complete  orthonormal  basis  for  X^(iE). 
Now,  having  a  crude  approximation  of  the  function  f{x)  at  a  coarse  resolution  level,  say  mi,  one 
can  improve  it  to  a  finer  resolution  level,  say  m2,  by  exploiting  the  wavelets  of  the  intermediate 
levels  in  the  following  manner: 


7712  —  1 

■^77T2(/)  “  "I"  'y  ^  ^  ^  f,'^Tn.,n  ^  '^2  ^  (3) 

m=mi  neZ 

The  definition  of  a  multiresolution  analysis  requires  that  'tp{x)dx  =  0,  implying  that  a 
wavelet  is  a  bandpass  function  with  no  dc  component.  The  scaling  function,  by  contrast,  is  a 
lowpass  function  with  J^^<t>{x)dx  =  1.  Hence,  a  multiresolution  expansion  of  a  square-integrable 
function  f{x)  with  an  arbitrary  spectral  content  can  be  expressed  in  the  following  form: 

oo 

fi^)  =  E  E  E  (4) 

keZ  771=7770  neZ 

where  mo  is  an  initial  resolution  level,  and  Cm,n  S'lid  dm,n  are  the  respective  expansion  coefficients 
given  by 
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Cm,n  —  f->4^Tn,n  > 


(5) 


Various  multiresolution  analyses  with  different  mathematical  properties  have  been  introduced  in 
recent  years.  The  Battle- Lemarie  MRA,  which  is  constructed  based  on  polynomial  spline  functions, 
has  been  efficiently  used  for  the  MoM  analysis  of  electromagnetic  problems  [2]- [4].  Fig.l  shows  the 
plots  of  the  cubic  spline  Battle- Lemarie  scaling  function  and  mother  wavelet. 


3.  The  Fast  Wavelet  Algorithm 

The  foundation  of  the  fast  wavelet  algorithm  is  based  on  the  following  two-scale  property  of  the 
multiresolution  analysis: 

=  V2 

neZ 

(6) 

^(x)  =  <5^(22:  -  n) 

neZ 

where  {hn}nez  is  a  zero-centered  symmetric  discrete  sequence  with  fast  decay,  and 

gn  =  ('i') 

Table  1  provides  the  values  of  the  first  few  positive  /in  coefficients  for  a  cubic  spline  Battle-Lemarie 
multiresolution  analysis. 

Table  II 

hn  Coefficients  for  the  Cubic  Spline  Battle-Lemarie  MRA 


n 

/in 

0 

0.76613005 

1 

0.43392265 

2 

-0.05020169 

3 

-0.11003702 

4 

0.03208086 

5 

0.04206833 

6 

-0.01717627 

7 

-0.01798229 

8 

0.00868520 

In  view  of  properties  (6),  it  is  not  difficult  to  derive  the  two  following  relations: 

<  ft4>m,n  >  ==  2Z  ^k-2n  ^  /•>  4*771+1, k  > 

fcS  Z 


<  f,4^m,n  >  =  Y^9k-2n  <  > 

kez 
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where  *  denotes  the  complex  conjugate.  Using  definitions  (5),  the  above  relations  can  be  written 
in  the  following  manner: 


c 


7n,n 


Xy  ^k-2n 

kez 


y  >  9k— 2n  ^m+lyk 

kez 


(9) 


Equations  (9)  are  the  mathematical  statement  of  the  fast  wavelet  algorithm.  Note  that  accord¬ 
ing  to  these  equations,  the  scaHng  and  wavelet  coefficients  at  each  resolution  can  be  computed  by 
a  discrete  convolution  from  a  knowledge  of  only  scaling  coefficients  at  the  next  higher  resolution. 
Fig.3  shows  the  signal  flow  diagram  for  the  fast  wavelet  algorithm,  where  the  filters  H  and  G 
involve  a  discrete  convolution  plus  a  decimation  by  2.  When  several  resolution  levels  are  required, 
it  is  therefore  sufficient  to  compute  the  scaling  coefficients  only  at  the  resolution  level  immediately 
following  the  highest  resolution  involved.  AU  the  expansion  coefficients  at  the  lower  resolution 
levels  can  be  evaluated  recursively  using  equations  (9). 


4.  Two-Dimensional  Multiresolution  Analysis 

A  two-dimensional  multiresolution  analysis  is  a  ladder  of  successive  nested  approximation  sub¬ 
spaces  Vmoi  Each  subspace  Vm  at  a  resolution  of  is  constructed  from  the  Cartesian 

product  of  two  one-dimensional  MRAs  at  the  same  resolution,  i.e.,  [6].  An  orthonor¬ 

mal  basis  for  Vm  would  consist  of  the  functions  ^rr,,n^,ny{x,y)  -  4>m,nA3:)4>m,ny{y),n^,ny  €  Z, 
where  the  one-dimensional  functions  in  x  and  y  are  properly  dilated  and  shifted  versions  of  the 
scaling  functions  ^(x)  or  <j){y),  respectively,  as  described  in  section  2.  The  orthogonal  complement 
oiVm  in  Vm+i  is  denoted  by  Wm^  and  it  can  be  shown  that 

Wm=(Wm^Vm)  0  {Vm  ®  Wm)  0  {Wm  ®  Wm)  (10) 

where  Wm  is  the  one- dimensional  complement  space  of  Vm-  Hence,  the  orthonormal  basis  for 
consists  of  three  sets  of  wavelets:  y)  =  y)  = 

and  (x, y)  =  with  '4!(x)  and  ip{y)  being  the  mother 

wavelet.  These  wavelets  are  called  vertical,  horizontal  and  diagonal,  respectively,  in  that  they 
sample  variations  of  the  expanded  function  primarily  along  the  corresponding  directions. 

Now,  one  can  approximate  a  doubly  square-integrable  function  f{x,y)  at  a  resolution  of  2~^ 
by  defining  the  following  projection  operator  onto  the  subspace  Vm : 

^  ^  ^  ^  ^m,na:,ny  2/)?  (H) 

Tlx&Z  Tly^Z 

with  the  improved  approximation  at  the  next  finer  resolution,  given  by 

P.n+l(/)  =  P^if)  +  E  E  E  <  (12) 

nx^Z  Uy^Z 

The  improvement  can  be  continued  to  any  arbitrary  resolution  by  including  the  intermediate  2-D 
wavelets. 
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The  two-dimensional  counterpart  of  equation  (4)  for  the  multiresolution  expansion  of  a  doubly 
square-integrable  function  f{x,y)  is  expressed  in  the  following  way: 

00 

Cm.o,kx,ky^mo,kx,ky{x,y)  X^  X!]  X^  ^m,nx,ny^m,nx,ny{^iy) 

kx^ZkyeZ  m=mo  i=v,k,dnseZ  Tiy^Z 

where  mo  is  again  the  initial  resolution  level,  and  Cm,nT,ny  and  are  the  respective  expan¬ 

sion  coefficients  given  by 


^TTl,nx,ny  ^  f  ^  ^T!l,TlxiT^y  ^ 


=  <  f,  ,i  =  v,h,d 

■^m,nx,ny  ^  ’  m,nx,ny  ’  ’  ’  ’ 


Then,  the  two-dimensional  fast  wavelet  algorithm  takes  the  following  form: 


Cm,nx,ny  ”  X^  X^  ^kx-2nx^ky-2ny  Cm+\,kx,ky 
kxeZ  kyEZ 

^m,nx,ny  ~  X^  XZ  S kx-2nx^ky-2ny  ^Tn+l,kx,ky 
kx&Zky^Z 

(15) 

^m,nx,ny  ~  XZ  X!]  ^kx -271x3 ky-2ny  Cm+\,kx,ky 
kx^Zky&Z 

^m,nx,ny  ”  X^  X]/  9kx-2nx9ky-2ny  ^m^-\,kx,ky 
kx&Z  ky 

Fig.4  shows  the  signal  flow  diagram  for  the  two-dimensional  fast  wavelet  algorithm.  Here  again, 
when  different  resolution  levels  are  required,  we  only  need  to  compute  the  scaling  coefficients  at  the 
resolution  level  immediately  following  the  highest  resolution  involved.  All  the  expansion  coefficients 
at  the  lower  resolution  levels  can  be  evaluated  recursively  using  equations  (15). 

5.  Numerical  Results  and  Conclusion 

In  the  implementation  of  the  fast  wavelet  algorithm  in  the  method  of  moments,  it  is  important 
to  note  that  the  moment  integrals  involve  twice  as  many  variables  as  the  dimension  of  the  integral 
equation,  counting  for  the  source  and  observation  variables.  In  other  words,  for  a  one- dimensional 
boundary  value  problem,  the  integration  is  performed  with  respect  to  two  variables  x  and  x  . 
Similarly,  in  a  two-dimensional  problem,  the  moment  integrals  involve  four  variables  x,y,x'  and  y'. 
For  instance,  a  typical  two-dimensional  moment  integral  has  the  following  form: 

kij  =  J  Fi{x,  y)G{x,  y  I  x',  y')  Fj{x',  y')dxdydx'dy’  (16) 

where  &{x,y  \  x‘ ,y^)  is  the  dyadic  Green’s  function  of  the  problem,  and  Fi{x,y)  and  Fj{x\y^)  are 
the  test  and  expansion  functions,  respectively.  Equation  (16)  in  fact  involves  a  shorthand  notation 
where  the  pair  of  indices  (i,  j)  have  been  used  to  replace  a  complex  combination  of  dilation  and 
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shift  indices,  which  would  otherwise  have  been  written  in  the  following  manner: 


ki^  =  k 


fsiiyfyi  |/ij 

»^xi  l^j 


yj 


(17) 


Here,  the  superscripts  fxi^  fyi,  fxj,  fyj^  denote  the  generating  functions  for  the  respective  variables 
and  are  either  (j>  (scaling  function)  or  ^tp  (wavelet).  The  fast  wavelet  algorithm  for  this  problem  can 
now  be  written  in  the  following  way: 


*«  =  E  E  E  E  (18) 

k3:i€Z  kyiEZ  k:cj^Z  kyj^Z 

'Vf** 

ikxi-2nxi  'kyi-2ny,  ikxj-2nxj  ^  kyj-2nyj-^^  mi-\-l,kx,i,kyi\mj^l^k^j 


where  either  hk  or  gk  depending  on  the  type  of  the  respective  generating  func¬ 

tions.  Equation  (18)  implies  that  once  we  have  the  interactions  between  all  the  2-D  scaling  functions 
at  the  resolution  level  immediately  following  the  highest  resolution  involved,  then  all  the  moment 
integrals  at  different  resolution  levels  including  inter-level  interactions  can  easily  be  computed  by 
simple  discrete  convolutions.  Note  that  the  quadruple  sum  in  this  equation  involves  the  product  of 
four  hn  or  gn  coefficients,  which  becomes  extremely  small  for  the  majority  of  index  combinations. 
Moreover,  the  shift  indices  usually  vary  over  a  finite  range  because  the  electromagnetic  quantities 
like  fields  and  current  are  defined  over  finite  regions. 

The  fast  wavelet  algorithm  has  been  applied  to  a  variety  of  2-D  and  3-D  electromagnetic  prob¬ 
lems.  In  this  paper,  we  consider  the  full-wave  analysis  of  dielectric  structures.  A  rectangular 
dielectric  slab  waveguide  of  dimensions  a  x  b  and  a  rectangular  dielectric  resonator  of  dimensions 
a  X  b  X  both  embedded  in  free  space,  have  been  studied.  In  the  slab  case,  the  2-D  volume 
polarization  current  is  expanded  in  a  2-D  basis  consisting  of  a  1-D  multiresolution  expansion  in  the 
transverse  direction  and  a  sub-domain  pulse  expansion  along  the  normal  direction.  In  the  resonator 
case,  the  3-D  volume  polarization  current  is  expanded  in  a  3-D  basis  consisting  of  a  2-D  multires¬ 
olution  expansion  in  the  transverse  plane  and  a  sub- domain  pulse  expansion  along  the  normal 
direction.  It  has  already  been  demonstrated  that  in  both  cases,  the  resulting  moment  matrices  are 
highly  sparse,  and  using  iterative  techniques  such  as  the  bi- conjugate  gradient  method  can  improve 
the  computation  speed  drastically. 

In  the  present  study,  we  have  implemented  the  fast  wavelet  algorithm  for  the  slab  and  resonator 
problems.  Fig. 4  shows  the  variation  of  the  normalized  propagation  constant  of  the  dominant  mode 
of  a  rectangular  dielectric  slab  with  a  =  26,  and  Cr  =  2.25  versus  a  normalized  frequency  defined 
as  y  =  (26/Ao)(er  —  1)^/^.  The  results  have  been  compared  to  those  of  a  conventional  moment 
method  with  a  2-D  pulse  basis  [7].  In  this  case,  each  application  of  FWA  algorithm  plus  the  Bi-CG 
method  for  the  solution  of  the  resulting  sparse  system  takes  a  fraction  of  a  second  on  a  SUNSPARK 
20  workstation.  The  results  for  rectangular  dielectric  resonators  have  been  presented  in  [4],  where 
a  comparison  with  other  approximate  methods  have  been  made.  For  example,  for  a  dielectric 
resonator  of  dimensions  a  =  lOmm,  6  =  8mm,  h  =  5mm,  and  =  20,  the  technique  described 
in  this  paper  using  the  FWA  yields  a  resonant  frequency  of  5.b32GHz  for  the  resonator  dominant 
mode.  The  computed  resonant  frequency  based  on  the  effective  dielectric  constant  (EDC)  method 
is  5.65lGHz  [8].  In  this  3-D  volume  integral  equation  problem,  each  application  of  FWA  algorithm 
plus  the  Bi-CG  method  takes  typically  four  to  six  minutes  on  a  SUNSPARK  20  workstation, 
depending  on  the  number  of  the  sub-domain  pulses  involved. 

Finally,  it  is  important  to  note  that  the  nature  of  the  fast  wavelet  algorithm,  in  its  4-D  or 
higher- dimensional  versions  for  the  computation  of  intra-level  and  inter-level  MoM  integrals,  easily 
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renders  itself  to  a  code  parallelization.  Although  this  process  has  not  yet  been  carried  out  by  the 
present  authors,  some  preliminary  code  optimizations  indicate  a  promising  improvement  in  the 
efficiency  and  speed  especially  for  three-dimensional  electromagnetic  problems. 
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Fig.l.  Cubic  spline  Battle- Lemarie  scaling  function  and  mother  wavelet. 
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1.  Introduction 

In  many  practical  high  frequency  scattering  problems,  the  scatterers  are 
very  large  compared  to  wavelength.  When  high  frequency  RCS  is  required, 
exact  solution  is  impossible  with  current  computer  resources.  In  this  case, 
approximate  methods  are  very  helpful.  Typical  approximate  techniques  are 
geometric  optics  (GO),  physical  optics  (PO),  geometric  theory  of  diffraction 
(GTD),  and  physical  theory  of  diffraction  (PTD).  These  approximations  have 
been  incorporated  into  the  shooting  and  bouncing  ray  (SBR)  method  [1]. 

In  this  paper,  we  develop  another  RCS  estimation  technique  called  the 
fast  far  field  approximation  (FAFFA).  The  idea  of  this  method  is  to  compute 
the  interactions  betw'een  elements  using  different  methods  depending  on  the 
electrical  distance  between  the  elements.  When  the  distance  is  small,  exact 
interaction  is  calculated;  -when  the  distance  is  large,  a  far  field  approxima¬ 
tion  is  used.  The  field  interactions  are  calculated  iteratively  using  conjugate 
gradient  (CG)  method.  The  iteration  process  is  equivalent  to  taking  into 
account  the  higher  order  terms  in  GO,  PO  or  PTD  approximations.  After 
several  iterations,  we  find  the  results  to  be  greatly  improved  over  that  of  a 
simple  PO  or  GO.  On  the  other  hand,  this  method  can  deal  with  small  scale 
target  geometry  variations  since  it  computes  the  near  field  interaction  exactly 
hence  it  is  more  flexible  to  be  applied  to  arbitrary  complex  targets.  This  algo¬ 
rithm  has  a  computational  complexity  of  and  memory  requirement 

of  0(iV)  for  N  unknown  problems. 

In  the  following,  we  first  discuss  the  algorithm  mathematically,  and  then 
give  numerical  results  to  validate  the  algorithm  and  show  its  applications 
to  large  size  objects.  The  formulation  is  presented  for  electric  field  integral 
equation  (EFIE)  for  TM  polarization.  The  numerical  simulation  contains 
both  2D  and  3D  results.  In  this  paper,  the  time  factor  of  is  assumed 
and  suppressed. 


t  This  work  was  supported  by  the  Office  of  Naval  Research  under  grant  N00014-89-J1286, 
the  National  Science  Foundation  under  grant  NSF  ECS  92-24466,  the  Array  Research 
Office  under  contract  DAAL03-91-G-0339  and  NASA  under  contract  NAG  2-871.  The 
computer  time  was  provided  by  the  National  Center  for  Supercomputing  Applications 
(NCSA)  at  the  University  of  Illinois,  Urbana-Champaign. 
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2.  Formulation 

Consider  a  TM  (to  z)  plane  wave  incident  on  a  two  dimensional  conduct¬ 
ing  cylinder  in  free  space.  The  integral  equation  satisfied  by  the  induced 
current  J  can  be  written  as, 

^  dr'J(r')Jl(^>(klr  -  r'|)  =  B--(r),  (1) 

where  is  a  0-th  order  Hankel  function  of  first  kind.  The  integration 

is  over  the  contour  of  the  cylinder.  =  Boexp(—ik^  •  r).  The  subscript 

z  for  J  and  is  impUed  here.  Moreover,  r  —  xcc  +  is  a  2D  vector, 
=  fc(:rcos^*  +  ysin0^),  where  9"  is  the  incident  angle,  and  fc  =  27r/A  is  the 
free  space  wave  number. 

To  discretize  Equation  (1),  we  first  divide  the  surface  into  strips,  and 
assume  that  the  induced  current  on  each  strip  is  a  constant  Jj.  Thus  the 
discretized  form  of  Equation  (1),  according  to  the  method  of  moments,  is  [2], 

N 

J29ijJj  =  -Bi/(307rk),  i  =  ,JV,  (2) 

J  =  1 

where, 

^  I  i^j, 

I  Aj  [l  +  zf  hi(0.163805fcAi)]  ,  i  ^  j, 

Aj  is  the  strip  width,  Ei  =  E^^^{Ti)  and  rji  —  jr^  —  rj|. 

Equation  (2)  can  be  solved  using  the  conjugate  gradient  (CG)  method 
[3].  The  most  costly  part  of  a  CG  iteration  is  the  performance  of  the  matrix- 
vector  multiply,  i.e.,  to  compute  the  summation  on  the  left  hand  side  of 
Equation  (2)  for  i  =  1,2,  •  •  ■  ,iV.  Hence,  the  operation  cormt  of  a  matrix- 
vector  multiplication  is  of  O(iV^). 

Various  methods  have  been  explored  to  accelerate  the  matrix- vector  mul¬ 
tiply  [4-10].  In  this  paper,  we  describe  an  algorithm  which  approximately 
performs  the  matrix-vector  multiply  of  Equation  (2)  with  opera¬ 

tions.  The  method  makes  use  of  the  two-level  algorithm  as  proposed  in 
the  original  fast  multiple  method  [4,7,8],  but  we  arrive  at  a  two  level  algo¬ 
rithm  with  lesser  complexity  compared  to  the  original  two-level  fast  multiple 
method. 

In  the  first  step,  we  divide  the  N  unknowns  into  groups  as  shown  in  Figure 
1.  The  rules  for  the  division  are  that:  (1)  each  group  should  have  roughly 
the  same  number  of  unknowns,  and  (2)  the  members  of  each  group  are  as 
close  to  each  other  as  possible.  Assuming  that  each  group  has  M  unknowns, 
the  number  of  groups  wiU  be  N/M.  In  the  following,  we  use  index  i  and  j  to 
number  unknowns,  and  use  Gi  and  Gi'  to  denote  groups  I  and  respectively. 

With  this  notation,  Equation  (2)  can  be  rewritten  as, 

N/M 

=  ieG„Z  =  1,2,  (3) 

V=1  jeGi, 
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Figure  1  The  discredited  scatterer  is  divided  into  groups. 
The  interaction  between  groups  with  large  distance  is  cal¬ 
culated  using  far  field  approximation;  the  near  field  groups’ 
interaction  is  calculated  exactly.  For  group  I,  groups  Z  -  1,  L 
and  Z  +  1  are  considered  as  near  field  groups  (NN);  For  group 
V,  it  is  considered  as  far  field  group  (FF). 


Depending  on  the  distance  ri>i,  we  define  two  neighboring  regions  for 
each  group  Gi.  One  is  the  near  field  (NN)  region,  the  other  is  the  far  field 
(FF)  region.  A  group  Gr  is  said  to  be  in  the  far  field  region  of  group  Gi 
if  the  distance  between  the  two  groups  is  larger  than  a  certain  quantity. 
Otherwise,  Gi>  is  in  the  near  field  region  of  G/.  Consequently,  the  summation 
in  Equation  (3)  can  be  spht  into  two  parts:  one  corresponding  to  the  near 
field  (NN)  contribution,  while  the  other  corresponding  to  the  far  field  (FF) 
contribution,  namely, 

y]  y]  J,FW(fcr,,)A,+  y]  y]  =  E,/{307rk), 


I'eNN  jeGif 


i  -e  Gul  - 


The  far  field  contribution,  which  is  proportional  to, 

VeFFj^Gi, 

can  be  calculated  approximately  by  first  applying  the  large  argument  approx¬ 
imation  to 


i 


krji  >  1. 


Next,  we  write 


Tji  =  Tjv  -h  Tm  +  rii. 
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Thus,  if  ri'i  rjii  and  rm  ^  ru^  then, 


Tji  ^  rvi  +  Tin  •  Vii  +  Tvi  •  TjV. 

Substituting  Equations  (6)  and  (7)  into  Equation  (5),  we  have 


2 


/.^pVTTiv^  y-iriVT^i 


2 


6n=  XI  (9) 

jeGi, 

Calculation  of  Equation  (8)  for  all  pairs  of  (/,  i)  requires  NM  +  N'^/M 
operations.  The  near  field  operation  count  is  proportional  to  NM,  and  the 
total  CPU  is 


CPU  -  CiNM  +  C2M  y—j  .  (10) 

where,  Ci  and  C2  are  constants.  Optimization  yields  M  —  Vn  and  CPU  ~ 
^1.5 

Equation  (8)  consists  of  three  steps.  The  first  step  is  called  aggregation. 
It  is  represented  by  the  summation  in  Equation  (9).  The  second  step  is 
translation,  corresponding  to, 


Finally,  the  third  step  is  disaggregation,  corresponding  to, 

hi  =  X 


The  reduction  in  operation  from  to  is  attributed  to  replacing  the 
translation  matrix  by  a  scalar  number. 

To  further  reduce  the  operation  count,  we  consider  the  aggregation  step 
more  carefully.  Computation  of  Equation  (9)  for  aU  1,1'  requires  N^/M  op¬ 
erations.  However,  we  know  that  bm  depends  only  on  ri'i  which  is  a  function 
of  an  angle  a  €  (— tt,  tt).  The  oscillation  of  6^/  with  respect  to  a  wiU  depend 
on  max(A;rjj/)  which  is  the  electrical  size  of  a  group.  Hence,  it  motivated 
us  to  compute  the  summation  in  Equation  (8)  in  two  steps.  The  first  step 
is  to  compute  it  on  equal  samples  of  a  over  [— tt,  tt],  and  then  to  compute 
bin  by  interpolating  with  the  known  values  on  the  samples.  The  number  of 
a  samples  can  be  chosen  to  be  proportional  to  the  number  of  unknowns  in 
the  group.  The  interpolation  process  is  shown  in  Figure  2.  Figure  2a  shows 
the  original  operation  which  is  the  direct  aggregation.  Figure  2b  shows  the 
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(a) 


MM  ^ 


Figure  2  (a).  Direct  aggregation,  (b).  Aggregation  using 
interpolation. 


aggregation  using  interpolation.  It  is  clear  that  if  N/M  is  larger  than  M, 
than  interpolation  procedure  is  more  efficient. 

In  order  to  reduce  the  total  computational  cost,  the  cost  for  the  disaggre¬ 
gation  stage  (Equation  (9))  should  also  be  reduced.  This  can  be  achieved  by 
smoothing  [11],  which  is  just  the  converse  of  interpolation.  In  this  manner, 
the  operation  count  or  the  CPU  ~ 

The  above  method  can  easily  be  extended  to  the  TE  polarization,  and 
also  to  the  combined  field  integral  equation. 

3,  Numerical  simulations 

In  this  Section,  we  give  numerical  examples  to  compare  the  calculated 
RCS  using  fast  far  field  approximation  (FAFFA)  with  the  result  by  CG  using 
exact  moment  matrix.  In  all  the  examples,  combined  field  integral  equation 
(CFIE)  is  used.  The  numerical  results  include  both  TM  and  TE  cases. 

Our  first  example  is  a  2D  open  ended  cavity.  The  cavity  is  of  length 
length,  width  and  thickness  10m,  5m  and  Im,  respectively.  The  polarized 
plane  wave  is  incident  at  0^  —  30°,  as  shown  in  Figure  3.  The  result  from  fast 
kr  field  approximation  agrees  well  with  the  exact  solution. 

The  second  example,  shown  in  Figure  4,  is  the  RCS  of  a  3D  90°  dihedral 
which  consists  of  two  flat  plate  of  same  size  (Im  x  20m,  20m  side  as  the 
common  edge).  Vertical  polarized  plane  wave  is  incident  from  the  direction 
of  {0\  4>^)  =  (45°,  270°)  with  a  frequency  300  MHz. 

The  above  two  figures  show  that  FAFFA  takes  into  account  of  multiple 
reflections  such  as  in  corner  reflector  and  in  cavity  like  targets.  Our  next 
example  show  that  FAFFA  will  also  takes  into  account  the  small  scale  target 
geometry  variations.  Figure  5  plots  the  RCS  of  a  two  dimensional  cylinder 
whose  cross  section  is  also  shown  in  the  figure.  The  geometry  variation  is  in 
less  than  a  wavelength  scale. 

From  Figures  3  to  5,  one  can  see  that  the  RCS  predicted  by  FAFFA  are 
very  close  to  the  exact  solution.  The  advantage  of  FAFFA  is  that  it  has  lower 
computational  complexity  (0(iV^/^))  and  small  memory  requirement  (0(iV)). 
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Hence  it  can  be  applied  to  predict  RCS  for  targets  with  very  large  electrical 
size.  Figures  6  and  7  are  two  examples. 


Figure  3  RCS  of  a  2D  open  ended  cavity. 


Angle  (DEG) 

Figure  4  The  RCS  of  a  3D  dihedral  corner  reflector. 


4.  Conclusions 

A  fast  far  field  approximation  (FAFFA)  for  RCS  calculation  of  larger  size 
targets  is  developed.  This  method  considers  near  field  and  far  field  interac¬ 
tions  differently  so  that  the  complexity  for  performing  a  matrix- vector  mul¬ 
tiplication  is  reduced  from  O(N^)  to  and  the  memory  requirement 

is  of  O(A^)  for  N  unknown  problems.  The  method  differs  from  exact  method 
since  far  field  approximation  is  used.  It  also  differs  from  pure  PO  since  the  so¬ 
lution  is  calculated  iteratively  so  that  the  higher  order  scattering  field,  such 
as  edge  diffraction,  surface  wave  and  multiple-reflection,  are  automatically 
taken  into  account.  Numerical  simulations  show  that  this  method  predicts 
RCS  which  are  very  close  to  the  exact  solution,  and  is  greatly  improved  over 
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Figure  5  The  RCS  of  a  2D  cylinder  with  zigzag  contour  of 
cross  section. 


Figure  6  The  RCS  of  a  2D  complex  target.  Plane  wave  of 
frequency  300  MHz  is  incident  from  0^  =  0"^  and  is  polarized 
in  z  direction.  N  =  52, 788. 


that  from  applying  pure  PO.  Also,  due  to  the  reduced  computational  com¬ 
plexity  and  memory  requirement,  the  method  can  be  apphed  to  calculate 
RCS  for  targets  with  very  large  electrical  size.  Om  ongoing  work  is  to  apply 
this  method  to  three  dimensional  scatterers,  which  will  be  reported. 
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Abstract 

The  Parameter  Estimation  Technique  (PET)  is  a  new 
technique  that  can  be  applied  to  numerical  codes  based 
on  dense  matrices  as  a  power  booster  for  the 
computation  of  the  response  of  the  physical  system  at 
different  frequencies,  etc.  It  is  applied  to  the  Multiple 
Multipole  Program  (MMP)  in  conjunction  with  the 
tnethod  of  Conjugate  Gradients  (CG)  for  iteratively  and 
evidently  solving  the  rectangular  MMP  matrix.  The 
PET  takes  advantage  of  the  a  priori  knowledge  obtained 
from  previous  computations  and  allows  to  a  dramatic 
reduction  of  the  computation  time  in  various  situations. 
The  computation  time  considerably  depends  on  the  PET 
basis  functions.  Although  useful  results  had  already 
been  obtained  with  simple  power  functions,  it  is  shown 
that  the  PET  can  be  drastically  improved  when  a  more 
general  basis  is  optimized  by  a  Genetic  Algorithm. 


t.  Introduction 

One  of  the  most  important  benefits  of  time  domain 
methods  such  as  the  Finite  Difference  Time  Domain 
(FDTD)  technique  [1]  comes  from  the  fact  that  the 
information  on  the  EM  field  of  the  previous  time  steps 
can  be  used  as  a  starting  value  for  the  computation  of 
the  actual  EM  field.  This  rigid  use  of  the  a  priori 
knowledge  [2,3]  allows  the  iterative  computation  of  the 
EM  field  in  each  time  step  with  typically  one  iteration 
only  -  provided  that  the  time  step  is  sufficiently  small. 
As  a  consequence,  the  method  is  efficient  and  fast. 


Today,  iterative  matrix  solvers  such  as  the  method  of 
Conjugate  Gradients  (CG)  are  most  frequently  applied 
to  large  sparse  matrices  although  the  CG  algorithm  was 
originally  designed  for  small,  dense  matrices  [4]. 
Variants  of  the  CG  have  also  been  applied  to  the 
Method  of  Moments  (MoM)  and  other  integral  methods 
[5].  Although  CG  is  very  helpful  in  many  situations,  it 
has  important  drawbacks:  Its  convergence  depends  to  a 
great  extent  on  the  choice  of  the  preconditioner  and  of 
the  starting  values.  Moreover,  CG  is  often  either  robust 
or  efficient  rather  than  robust  and  efficient  as  desired. 

The  MMP  codes  [6-8]  are  based  on  an  overdetermined 
system  of  equations  characterized  by  a  dense, 
rectangular  matrix.  The  condition  number  of  the  MMP 
matrix  is  often  very  high.  Therefore,  many  of  the  simple 
matrix  solvers  cannot  successfully  be  applied  -  except 
for  almost  trivial  problems  [9].  This  is  the  most 
important  reason  why  the  CG  matrix  solver  in  the  2D 
MMP  code  [10]  was  rarely  used  in  the  past.  It  is 
important  to  note  that  the  bad  condition  of  the  matrix 
does  not  cause  inaccurate  results.  In  contrary,  it  has 
been  shown  [2,3]  that  the  accuracy  of  the  results  can 
often  be  increased  by  increasing  the  condition  number  of 
the  matrix.  Moreover,  ill-conditioned  matrices  simplify 
the  modelling  and  are  important  for  the  user- 
friendliness  of  the  code. 

For  the  reasons  outlined  above,  iterative  MMP  matrix 
solvers  were  discarded  [10]  and  Block-Iterative 
Techniques  (BIT)  [8,11]  were  preferred  for  computing 
large  MMP  models.  MMP-BIT  is  a  sophisticated 
technique  that  can  be  applied  by  MMP  wizards  only.  It 
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is  helpful  for  solving  large  problems,  but  it  usually  does 
notallow  to  considerably  reduce  the  computation  time. 
In  inverse  scattering  and  other  optimization  problems 
where  a  large  number  of  slightly  varying  MMP  models 
must  be  computed,  one  is  interested  in  a  robust  and 
simple  method  that  allows  to  keep  the  computation  time 
as  small  as  desired  and  MMP-BIT  is  not  well  suited  for 
this  purpose. 

It  is  very  important  to  recognize  that  CG  and  other 
iterative  matrix  solvers  can  handle  ill-conditioned 
matrices.  The  main  problem  for  the  efficient  use  of  such 
techniques  is  a  sufficiently  accurate  estimation  of  the 
initial  values.  The  key  to  finding  good  starting  values  is 
governed  by  some  a  priori  knowledge  [2,3]  obtained 
from  previous  computations.  As  soon  as  the  EM  field 
has  been  computed  for  a  specific  model,  one  possesses 
some  a  priori  knowledge  for  the  computation  of  slightly 
modified  models.  This  knowledge  should  be  recycled 
rather  than  wasted.  The  main  idea  of  the  Parameter 
Estimation  Technique  (PET)  is  the  recycling  of 
knowledge.  This  technique  is  explained  in  the  following 
sections. 

The  second  problem  for  iterative  techniques  is  the 
stopping  criterion.  Unfortunately,  the  convergence  of 
the  CG  algorithm  applied  to  overdetermined  systems  of 
equations  shows  a  staircase  behaviour  and  the 
maximum  number  of  iterations  required  for  a  given 
accuracy  is  not  known  in  advance.  It  has  been  shown  in 
[12]  that  the  PET  provides  both  an  appropriate 
estimation  of  the  starting  values  and  an  appropriate 
stopping  criterion.  In  the  following,  we  will  outline  the 
PET  and  emphasize  a  further  improvement  of  this 
technique  obtained  by  an  optimization  of  the  PET  basis 
using  a  Genetic  Algorithm  [13].  Before  we  start  with  the 
description  of  the  PET,  we  have  to  resume  some  MMP 
features  that  are  relevant  for  the  PET. 


2,  Some  important  MMP  features 

In  MMP  and  other  matrix  methods,  the  EM  field  is 
defined  by  a  series  expansion  of  the  form 

K 

Field  =  X  A-  4-  Error  (1) 

k=\ 

Since  MMP  is  based  on  a  boundary  method,  the  basis 
functions  are  analytic  solutions  of  Maxwell’s  equations 
inside  the  domain  where  they  are  applied  to 


approximate  the  EM  field.  Multipole  expansions  are 
most  frequently  used  due  to  their  local  behaviour  and 
efficient  recursive  computation  [6-8]. 

The  parameters  A*  of  the  MMP  expansion  (1)  are 
determined  by  the  Generalized  Point  Matching  (GPM) 
technique  [8].  This  technique  is  relatively  simple.  It 
leads  to  an  overdetermined  system  of  equations  that  is 
solved  in  the  least  squares  sense.  Since  the  different 
equations  are  weighted,  this  method  is  also  called  the 
method  of  weighted  residuals.  The  GPM  implicitly 
minimizes  an  error  function  defined  along  the 
boundaries  of  all  domains  and  has  interesting 
advantages  when  the  condition  number  of  the  matrix  is 
large. 

The  MMP  codes  offer  several  direct  and  iterative 
methods  for  solving  the  matrix  equations.  Although  the 
Singular  Value  Decomposition  (SVD)  is  optimal  for  ill- 
conditioned  matrices,  this  technique  is  rarely  applied 
because  it  is  veiy  time-  and  memory  consuming.  In  the 
following,  SVD  has  only  been  used  for  computing  the 
condition  numbers.  Householder  transformations  and 
Cholesky  factorization  are  faster  than  the  Givens 
updating  procedure,  but  these  methods  are  rarely  used 
because  they  fail  when  the  condition  number  is  large. 
Therefore,  Givens  updating  is  the  most  frequently  used 
direct  matrix  solver  in  MMP.  Among  the  iterative 
matrix  solvers,  CG  seems  to  be  most  appropriate.  This  is 
the  first  choice  in  conjunction  with  the  PET. 

It  should  be  pointed  out  that  the  often  ill-conditioned, 
rectangular  MMP  matrices  cause  special  problems  that 
are  not  encountered  in  other  matrix  methods.  Therefore, 
it  is  assumed  that  the  application  of  the  PET  to  the 
MMP  code  is  more  demanding  than  the  application  of 
the  PET  to  other  codes. 


3.  The  Test  Examples 

In  [10]  the  EM  scattering  of  a  plane  wave  at  a 
cylindrical,  perfectly  conducting  plate  with  the  cross 
section  shown  in  Figure  1  has  been  considered  for 
testing  the  performance  of  the  CG  algorithm  in  the  2D 
MMP  code.  Although  this  example  is  very  simple,  it 
shows  all  the  main  effects.  Therefore,  it  is  well  suited  as 
a  test  case  for  the  MMP-CG-PET  implementation. 

It  is  well  known  that  the  number  and  location  of  the 
multipoles  used  for  simulating  the  scattered  EM  field  is 
critical  for  the  accuracy.  Simple  rules  for  setting  the 
multipoles  and  automatic  pole  setting  procedures  are 
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described  in  [8,15].  Here,  it  seems  to  be  reasonable  to 
set  all  multipoles  on  the  y  axis  as  shown  in  Figure  1. 
Both  the  accuracy  and  the  condition  number  of  the 
MMP  matrix  considerably  depend  on  the  number  of 
muitipoles.  A  detailed  study  of  this  dependence  has  been 
presented  in  [12].  It  has  been  shown  that  the  most 
accurate  results  are  obtained  when  the  number  of 
multipoles  is  relatively  high,  i.e.,  when  the  condition 
number  is  relatively  large. 
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Figure  1:  Cross  section  of  the  test  case.  The  incident 
EM  field  is  a  plane  wave  with  wave  vector  in  the  x 
direction  and  either  the  electric  field  or  the  magnetic 
field  in  the  y  direction. 

According  to  [14],  the  critical  points  of  a  such  model 
are  at  the  connections  of  straight  lines  and  circles.  Thus, 
the  efficiency  and  accuracy  can  considerably  be 
increased  when  off-axis  muitipoles  are  located  near  the 
critical  points.  In  the  following,  we  will  use  two 
different  MMP  models,  one  with  equally  spaced 
multipoles  on  the  axis  and  another  one  with  a  higher 
number  of  off-axis  multipoles. 

For  testing  the  generality  of  the  PET,  we  will  consider  a 
dielectric  cylinder  of  the  same  shape  as  in  Figure  1  with 
the  relative  permittivity  equal  to  four.  For  this  model, 
two  sets  of  multipoles  are  used,  one  with  muitipoles  on 
the  y  axis  for  modelling  the  scattered  field  and  one  with 
off-axis  multipoles  outside  the  cylinder  for  modelling 
the  field  inside  the  dielectric. 


3.  Parameter  Estimation  Technique  (PET) 

The  Parameter  Estimation  Technique  (PET)  is  based  on 
the  concept  of  recycling  the  information  obtained  from 
previous  computations  of  a  similar  problem.  When 
equation  (1)  holds,  this  information  is  stored  in  the 
parameters  Ak,  in  the  basis  functions  BasiSk,  and  in 
derived  quantities.  When  the  basis  functions  are  kept 
unchanged  during  several  computations,  the  parameters 
Ak  (and  all  derived  quantities)  are  functions  of  the 
variables  of  the  computations.  These  functions  are 
known  in  all  the  points  where  they  were  computed  in 
previous  runs.  This  allows  to  apply  any  extrapolation  or 
interpolation  technique  for  estimating  the  parameters  to 
be  computed  in  the  current  run.  If  the  estimation  is 
sufficiently  accurate,  one  has  excellent  starting  values 
for  iterative  techniques. 

Although  the  PET  can  be  applied  to  any  variable,  we 
consider  the  frequency  as  a  single  variable  for  reasons  of 
simplicity,  i.e.,  we  consider  the  functions  Ak(f).  At  low 
frequencies,  these  functions  are  very  simple  and  smooth. 
Therefore,  it  is  easy  to  obtain  an  accurate  estimation  of 
the  parameters  Ak(f)  when  one  already  has  computed 
Ak(f)  for  at  least  one  frequency  f  that  is  not  too  far 
away  from/.  At  the  beginning  of  the  computation  of  the 
frequency  dependence,  one  usually  has  no  knowledge  on 
the  size  of  the  parameters  for  the  first  frequency  /. 
Therefore,  one  has  no  good  starting  value  for  the  CG 
computations  of  the  parameters  Ak(f}).  Although  one  can 
start  CG  iterations  with  an  arbitrary  set  of  parameters, 
the  CG  computation  of  the  first  parameter  set  turns  out 
to  be  much  more  time  consuming  than  the  Givens 
solution.  Therefore,  it  is  reasonable  to  compute  the 
parameter  set  for  the  first  frequency  with  the  Givens 
updating  procedure.  For  all  other  frequencies  one  can 
assume  that  the  parameter  set  of  the  previous  frequency 
is  a  good  estimate  for  the  parameter  set  of  the  actual 
frequency. 

If  we  consider  the  frequency  dependence  of  the 
parameters  in  an  interesting  area  where  one  observes 
resonances  of  the  structure,  we  observe  a  very  strong 
variation  of  all  the  parameters  which  makes  the 
parameter  estimation  more  complicated.  Nonetheless, 
we  can  assume  that  the  parameter  set  of  a  given 
frequency  provides  a  useful  starting  value  for  the 
iterative  computation  of  the  parameter  set  for  a  slightly 
higher  or  lower  frequency. 

The  parameter  estimation  Akif)-Ak(fi)+ Error  is  simple 
and  crude.  It  corresponds  to  a  zero  order  extrapolation. 
Of  course,  one  can  expect  more  accurate  parameter 
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estimations  from  extrapolations  with  higher  order  power 
series  approximations  of  the  functions  For  an  n-th 
order  extrapolation  one  has 

Ak(f)  =  %atjf  ‘  +  Error ,  (2) 

;=0 

with  n  +  i  basis  functions  /  For  computing  the 
unknowns  akp  at  least  /=n+i  values  At  (fi )  must  be 
stored  With  /=n+i  one  obtains  the  following  / 

equations  for  the  n+i  unknowns: 

=A,(f,).  (3) 

J=0 

Note  that  one  has  to  solve  a  system  of  I  equations  for 
each  of  the  K  parameters  A*.  Since  the  same  basis 
functions  (fj  are  used  for  the  estimation  of  all 
parameters,  the  matrices  of  all  these  systems  of 
equations  are  identical  and  can  be  solved 
simultaneously.  Thus,  the  computation  time  for  the 
parameter  estimation  is  neghgible  -  except  for  very  high 
orders  n. 

At  the  beginning  of  the  computation  of  the  frequency 
dependence  with  n-th  order  PET,  one  uses  a  Givens 
solution.  Since  the  parameter  sets  of  n+I  frequencies 
are  required,  one  could  compute  the  first  n+1 
frequencies  with  the  direct  Givens  matrix  solver. 
According  to  our  experience,  already  a  zero  order  PET 
allows  to  save  computation  time  with  an  iterative  CG 
matrix  solver.  Therefore,  one  only  computes  the  first 
frequency  with  Givens  and  all  other  frequencies  with 
CG  based  on  a  ife-th  order  PET,  where  k  is  either  equal 
to  n  or  equal  to  the  frequency  step  number  minus  one. 

It  has  been  shown  in  [12]  that  the  first  order  PET 
allows  to  considerably  reduce  the  number  of  CG 
iterations,  but  the  second  order  PET  is  already  much 
worse.  This  disappointing  result  can  be  removed  by 
more  sophisticated  extrapolation  techniques  as  discussed 
in  [2,3].  First,  one  can  use  I>n+]  equations  for  an  n-th 
order  PET.  This  requires  the  introduction  of  appropriate 
weights  Wi  and  leads  to  the  following  overdetermined 
systems  of  equations  that  are  similar  to  the  matrix 
equations  of  the  MMP  codes: 

Svv,a,^(f,)'=w,A,(f,)+E.  (4) 

;=0 

Since  the  solution  of  an  overdetermined  system  of 
equations  depends  very  much  on  the  choice  of  the 
weights,  one  has  to  find  appropriate  weights  first.  This 
is  a  demanding  task,  because  we  would  like  to  use  the 
same  weights  for  computing  all  parameters 
simultaneously  and  because  the  frequency  dependence 
of  the  different  parameters  in  an  MMP  expansion  may 


be  very  complicated.  Nonetheless,  one  can  apply  a 
genetic  algorithm  for  finding  appropriate  weights. 
Although  we  observed  some  improvement  of  higher 
order  PET  with  appropriate  weights,  we  do  not  describe 
this  technique  in  detail  because  the  technique  described 
below  is  much  more  successful. 


4,  Optimized  PET 

The  main  problem  of  the  higher  order  PET  is  caused  by 
the  fact  that  higher  order  power  series  are  not  useful  for 
the  extrapolation  of  functions.  Therefore,  we  have  to 
replace  the  basis  functions/^  in  (2)  by  more  appropriate 
and  more  general  basis  functions: 

Ak(f)  =  ^a^jhasisj  (f  )  + Error  (5) 
;=0 

The  search  of  appropriate  basis  functions  basis/f)  is  an 
even  much  more  demanding  task  than  the  search  of 
appropriate  weights  mentioned  in  the  previous  section. 
First  of  all,  it  is  important  that  these  basis  functions 
should  be  appropriate  for  simultaneously  estimating  all 
the  parameters  of  any  MMP  expansion  for  any  MMP 
model. 

Since  the  frequency  can  vary  in  a  wide  range,  it  is 
reasonable  to  replace  /  in  (5)  by  another  variable  x  that 
is  restricted  to  the  interval  0...i.  This  should  be  done  in 
such  a  way  that  all  frequencies  fi  used  in  the  PET  are 
transformed  to  points  jc,  in  the  interval  0...1.  If  one 
traces  the  frequencies  starting  at  the  lowest  frequency  of 
the  frequency  range,  the  actual  frequency /is  the  highest 
frequency  when  the  PET  is  performed.  Therefore,  one 
maps  the  lowest  frequency  /  of  the  PET  on  Xi-0  and 
the  actual  frequency  /  on  x~l.  The  basis  functions 
basis(x)  need  to  be  known  in  the  points  Xi,  Xz,  ...X/,  and 
x=l  rather  than  in  the  entire  interval  0...1.  Therefore, 
one  could  replace  these  functions  by  simple  sets  of  /+/ 
values  bi=basis(Xi),  where  i=I,2,...I+L  On  one  hand, 
this  would  simplify  the  search  of  an  appropriate  basis, 
on  the  other  hand,  this  could  cause  problems  when 
different  spacing  of  the  points  x  in  the  interval  0...1  are 
applied.  Since  one  often  prefers  logarithmic  frequency 
scales  instead  of  linear  ones,  we  preferred  to  search 
continuous  basis  functions  rather  than  sets  of  discrete 
values. 

The  selection  of  an  appropriate  set  of  basis  functions  is  a 
typical  task  for  Genetic  Algorithms  (GAs)  [13].  GAs  are 
based  on  concepts  of  natural  evolution,  i.e.,  mutation, 
crossover,  and  selection.  GAs  optimize  bitstrings  in 
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such  a  way  that  their  fitness  is  increased.  In  our  case, 
the  bitstrings  contain  sets  of  PET  basis  functions.  The 
coding  of  the  bitstrings  is  the  most  crucial  part  of  the 
GA  user.  First  of  all,  each  of  the  admitted  PET  basis 
functions  can  be  characterized  by  several  parameters. 
Each  of  these  parameters  is  characterized  by  several 
bits.  If  one  wants  to  admit  a  large  variety  of  functions 
with  many  parameters  or  if  many  bits  are  used  for  each 
parameter,  the  bitstrings  become  very  long  and  the 
performance  of  the  GA  becomes  slow.  For  example, 
when  we  would  like  to  optimize  the  five  basis  functions 
of  a  fourth  order  PET  and  if  we  would  admit  five  real 
and  two  integer  parameters  with  32bit  per  parameter, 
the  length  of  our  bitstring  would  exceed  1000.  In  this 
case,  the  GA  would  have  to  select  the  fittest  of  more 
than  1E300  possible  bitstrings.  If  the  computation  of  the 
fimess  of  one  individual  takes  about  one  second,  the  GA 
can  check  about  3E7  bitstrings  within  one  year,  which  is 
a  very  small  fraction  of  1E300.  It  is  important  to  note, 
that  we  do  not  need  to  know  the  fittest  among  the 
possible  bitstrings.  The  GA  must  not  test  all  of  them  for 
finding  a  sufficiently  fit  bitstring.  Nonetheless,  we 
should  support  the  GA  by  making  the  bitstrings  as  short 
as  possible.  This  requires  some  knowledge  and 
experience  of  the  GA  user.  First,  we  characterized  our 
basis  functions  with  a  3  bit  integer  and  two  10  bit  real 
numbers,  i.e.,  23  bits  per  basis  function.  In  addition,  our 
bitstring  contained  eight  weights  characterized  by  10  bit 
real  numbers.  Later  we  removed  the  weights  and  used 
only  2  bits  for  the  integer  and  9  bits  for  the  real 
numbers.  We  tested  several  types  of  common  functions 
(sin,  cos,  exp,  log,  etc.)  and  combinations  of  such 
functions,  e.g.,  a*sin(b+x*x),  a*exp(-x*x/b),  where  a,b 
are  real  parameters.  Finally,  we  used  only  the  following 
functions  with  0.i<a<10.0  and  0<n<22: 

l/(l+a*x)^n, 

(Lx/a)'^n, 

2-l/(l+a*x)^n, 

2-(l-x/a)^n. 

For  running  a  GA,  we  must  give  a  precise  definition  of 
the  fitness.  Each  bitstring  of  the  GA  is  decoded  and 
gives  us  a  set  of  PET  basis  functions.  To  test,  whether 
this  set  is  useful  for  our  MMP-CG-PET  code,  we  must 
run  several  test  examples  and  we  must  compute  the 
frequency  dependence  of  these  examples.  The  total 
number  of  CG  iterations  required  is  a  measure  of  the 
fimess.  The  most  simple  definition  of  the  fimess  is 

fimess=l/toial  number  of  CG  iterations. 


This  definition  is  not  unique,  but  it  seems  that  it  plays  a 
minor  role  for  ±e  GA  performance. 

First  of  all,  we  must  specify  the  frequency  range  and  the 
MMP  models  of  our  test  cases.  Like  in  the  selection  of 
our  set  of  allowed  PET  basis  functions,  we  first  tried 
several  combinations  of  frequency  ranges  and  MMP 
models.  If  many  different  MMP  models  are  involved  in 
the  definition  of  the  fitness,  the  computation  of  the 
fitness  becomes  very  time  consuming.  Since  a  GA 
typically  requires  the  computation  of  the  fitness  of 
hundreds  of  individuals,  i.e„  bitstrings,  we  are  forced  to 
use  only  a  few,  simple  MMP  models. 

We  first  started  with  only  two  excitations  of  one  MMP 
model:  the  TE  and  the  TM  excitation  of  the  cylinder 
shown  in  Figure  1  with  9  multipoles  on  the  axis.  The 
frequency  range  was  chosen  from  30  MHz  up  to  300 
Mhz.  We  first  used  a  linear  frequency  scale  and  100 
frequency  steps.  This  allowed  us  to  find  many  PET  basis 
functions  causing  an  average  of  almost  1  CG  iteration 
per  frequency.  Since  at  least  one  CG  iteration  is 
performed  anyway,  we  decided  to  use  a  logarithmic 
frequency  scale  and  only  9  frequency  steps.  Such  a 
rough  discretization  of  the  frequency  range  causes  a 
very  bad  performance  of  the  PET  based  on  power  series. 
We  expected  that  a  genetically  optimized  basis  for  the 
PET  allows  to  reduce  the  total  number  of  iterations 
considerably,  maybe  by  a  speedup  factor  of  10.  This 
factor  is  related  to  the  first  order  unoptimized  (linear) 
PET  because  this  is  the  best  PET  that  had  been  found  in 
[12].  But  we  found  that  even  larger  factors  can  be 
obtained.  Since  a  further  reduction  of  the  number  of 
frequency  steps  makes  not  much  sense  (at  least  6 
frequencies  are  required  for  performing  a  fourth  order 
PET),  we  replaced  our  MMP  model  by  one  with  more 
unknowns  and  with  a  total  number  of  20  off-axis 
multipoles.  At  the  same  time  we  increased  the  desired 
accuracy  in  the  stopping  criterion  of  the  CG  algorithm. 
This  test  case  caused  a  total  number  of  575  CG 
iterations  when  a  first  order  (linear)  PET  was  applied. 
Since  the  minimum  number  of  CG  iterations  would  be  9 
for  9  frequency  steps,  this  allows  to  obtain  a  speedup 
factor  of  about  60. 

We  used  a  GA  for  finding  several  sets  of  basis  functions 
for  optimized  «-th  order  PET  with  n-  0,1, 2. .,4.  The 
results  were  both  surprising  and  encouraging.  The 
speedup  was  so  drastic  that  we  feared  that  the  optimized 
PET  basis  functions  are  adapted  to  our  test  case  only, 
and  that  a  much  worse  speedup  could  be  observed  when 
either  the  frequency  range  or  the  MMP  model  would  be 
changed.  Therefore,  we  applied  the  optimized  PET  to 
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three  different  MMP  models  (two  with  ideally 
conducting  cylinders  and  one  with  a  dielectric  cylinder) 
with  TE  and  TM  excitation  and  we  used  linear 
frequency  scaling  in  addition  to  the  logarithmic  scaling. 
Moreover,  we  traced  the  frequency  backwards,  i.e., 
starting  with  the  highest  frequency. 

Some  of  the  PET  basis  functions  that  had  been 
optimized  with  the  GA,  required  much  more  CG 
iterations  (especially  when  the  frequency  was  traced 
backwards),  but  many  of  them  turned  out  to  be  very 
robust.  The  speedup  for  the  more  complicated  MMP 
model  (dielectric  cylinder)  was  even  considerably  higher 
than  for  the  model  that  had  been  used  in  the  GA 
optimization  procedure.  Therefore,  we  are  confident  that 
these  optimized  basis  functions  will  be  useful  for  any 
MMP  model. 

It  should  be  pointed  out  that  we  found  our  robust 
optimized  PET  bases  with  an  evolutionary  procedure  on 
three  levels:  On  the  first  level,  we  provided  a  set  of  basis 
functions  that  seemed  to  be  reasonable  to  us.  This  was  a 
selection  based  on  our  ‘a  priori’  knowledge  and  on  our 
experience.  On  the  second  level,  the  GA  selected  the 
most  appropriate  basis  functions  for  a  given  test  case; 
i.e.,  the  GA  optimized  the  parameters  characterizing  the 
basis  functions  provided  on  the  first  level.  On  the  third 
level,  we  applied  the  best  basis  functions  found  on  the 
second  level,  to  a  larger  number  of  test  cases  and  we 
selected  the  most  robust  sets  of  basis  functions. 

The  best  set  that  has  been  found  up  to  now  is  much 
better  than  we  had  ever  hoped,  but  it  is  certainly  not 
optimal.  Therefore,  the  time-consuming  search  for 
better  solutions  goes  on  and  the  MMP  users  can  take 
advantage  of  a  code  with  an  excellent  performance  that 
is  further  improved  when  a  better  PET  basis  is  found  by 
our  PET-optimizer  running  in  the  background. 


5.  Results 

For  optimizing  the  PET  basis  functions,  we  used  three 
different  MMP  models  of  the  “2D  ACES  cylinder”  (see 
Figure  1  and  Table  1).  All  models  were  excited  with  a 
TE  and  a  TM  plane  wave  travelling  in  x  direction. 

The  frequency  range  from  30MHz-300MHz  was 
scanned  with  very  rough  steps.  Only  10  values  were 
computed  for  the  entire  decade.  Four  different  frequency 
models  were  considered:  1)  logarithmic  upward,  2) 
logarithmic  downward,  3)  linear  upward,  4)  linear 


downward.  These  rough  frequency  models  cause  a  huge 
number  of  CG  iterations  when  the  simple  linear  PET  is 
applied.  When  one  runs  downward,  one  starts  in  an 
area  close  to  a  resonance  of  the  structure  where  the 
parameters  of  the  MMP  expansion  vary  rapidly.  In  this 
case,  the  CG  convergence  is  so  slow  that  considerably 
more  iterations  than  unknowns  are  required.  More  than 
10’ 000  CG  iterations  were  required  for  computing  a 
single  frequency  of  the  third  MMP  model.  Since  the 
direct  Givens  solution  is  more  efficient  than  the  CG 
solution  when  more  than  about  50  CG  iterations  are 
required  for  this  model,  it  seems  to  be  hopeless  to  find 
PET  basis  functions  that  allow  such  a  drastic  reduction 
of  the  CG  iterations.  As  one  can  see  from  Table  2,  we 
found  even  zero  order  PET  solutions  with  an  average  of 
less  than  7  CG  iterations  per  frequency. 


Material 

Expansion 

Matrix 

ideal  conductor 

9  multipoles  on  y-axis 

164*75 

ideal  conductor 

20  off-axis  multipoles 

164*49 

dielectric,  £r==4 

5  multipoles  on  y-axis 
+S  off-axis  multipoles 

144*95 

Table  1:  Expansion  and  matrix  size  of  the  three  MMP 
models  of  the  *‘2D  ACES  cylinder*'  used  for  the  PET 
optimization. 


Especially  the  CG  computation  of  the  second  frequency 
of  a  frequency  plot  gives  rise  to  a  problem.  Here,  one 
has  only  one  set  of  parameters  that  is  known  from  the 
initial  Givens  solution  of  the  first  frequency.  Therefore, 
one  can  only  use  a  zero  order  PET  with  a  single  basis 
function.  Since  one  does  not  know  whether  the 
parameter  to  be  estimate  is  increasing  or  decreasing,  it 
seems  to  be  reasonable  to  use  a  constant  as  the  first  PET 
basis  function.  It  may  be  surprising,  that  the  first  basis 
function  in  Table  1  is  much  better  than  that.  When  the 
third  MMP  model  is  computed  with  this  PET  basis 
function  instead  of  a  constant  one,  only  a  few  CG 
iterations  are  required  instead  of  thousands.  In  the 
interval  x=0...1,  the  first  PET  function  decays  very 
rapidly  from  one  to  zero.  It  is  important  to  note  that  the 
first  basis  functions  for  all  optimized  higher  order  PET 
are  similar.  The  optimized  zero  order  PET  estimates  the 
MMP  parameters  of  a  frequency  by  a  multiplication  of 
the  MMP  parameters  of  the  previous  frequency  with  a 
very  small  factor  of  about  lOE-11.  It  seems  that  it  is 
good  to  start  the  CG  with  such  an  estimate  of  the  MMP 
parameters.  This  is  surprising,  because  setting  all  MMP 
parameters  equal  to  zero  is  not  a  good  starting  point  for 
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the  CG  algorithm  in  the  MMP  code.  Maybe,  the  strange 
optimized  zero  order  PET  basis  functions  can  be 
explained  by  the  bad  condition  of  the  MMP  matrices. 


Order 

PET  basis  functions 

Iterations 

0 

l/(l+2.6185897*x)**20 

6.72 

1 

l/(l+7.927(X)93*x)**12 

2-l/(l+8.8181974*x)=^*8 

6.13 

2 

l/(l+9.7093954*x)**13 

2-l/(l+3.2772961*x)**3 

(l-x/3.1223116)**3 

5.806 

3 

l/(l+8.9344432’^x)^=^18 

2-l/(l+8.9344432=^x)**18 

2-{l-x/5.9121316)**2 

2-(l-x/5.5246555)**19 

5.815 

4 

l/(l44.0909969*x)**18 

2-l/(l+3.4710391=^x)**6 

(i-x/6.0283774)**17 

l/(l+3.8197666*x)**20 

2-l/(l+8.5469671*x)**12 

5,889 

Table  2:  some  of  the  preliminary 

'optimaV  PET  basis 

functions  for  the  2D  MMP  found  before  January  6, 
1995.  The  iterations  indicated  are  the  average  of  CG 
iterations  required  for  one  frequency  step  of  our  test 
cases  with  the  rough  frequency  steps.  The 
corresponding  number  of  CG  iterations  with  a  linear 
(first  order)  PET  was  not  explicitly  computed  because 
more  than  10000  iterations  were  required  for  a  single 
frequency  step  of  some  test  cases.  The  basis  functions 
for  the  PET  orders  0...3  are  plotted  in  Figure  2. 


It  is  important  to  recognize  that  the  optimized  second 
order  PET  is  better  than  the  optimized  first  and  zero 
order  PET,  whereas  the  unoptimized  first  order  PET 
was  better  than  any  unoptimized  higher  order  PET 
(based  on  power  series).  However,  the  optimized  third 
order  PET  is  slightly  slower  than  the  optimized  second 
order  PET.  It  seems  that  this  is  caused  by  the  fact  that 
the  corresponding  GA  search  is  slower,  i.e.,  the  higher 
order  PET  that  have  been  found  up  to  now  are  further 
away  from  the  optimum.  Although  it  is  very  likely  that 
we  will  find  more  efficient  higher  order  PET  basis 
functions,  we  cannot  expect  these  solutions  to  be  much 
more  efficient  than  lower  order  PET,  when  we  continue 
with  large  frequency  steps  for  the  following  reason.  For 
a  fourth  order  PET  we  need  five  previous  frequencies  for 
estimating  the  MMP  parameters,  i.e.,  the  first  frequency 
used  in  the  PET  is  about  half  a  decade  away  from  the 
actual  frequency.  In  our  third  MMP  model  we  have 


more  than  one  resonance  within  half  a  decade. 
Therefore,  we  do  not  assume  that  the  information  in  the 
first  frequency  allows  to  considerably  improve  the 
estimate  of  the  parameters. 


6.  Conclusions 

The  optimized  PET  presented  in  this  paper  is  a  very 
powerful  and  promising  new  technique  that  can  be 
applied  as  a  power  booster  to  many  numerical  codes.  It 
takes  advantage  of  the  knowledge  obtained  from 
previous  computations  and  allows  to  replace  direct 
matrix  solvers  by  efficient  iterative  matrix  solvers  even 
when  the  condition  of  the  matrix  is  so  bad  that  the 
iterative  matrix  solvers  would  fail  without  the  PET. 

The  optimization  of  the  appropriate  basis  functions  for 
the  PET  is  extremely  time  consuming  and  difficult. 
With  the  help  of  a  genetic  algorithm  we  were  able  to 
find  promising  basis  functions  for  the  PET  that  allowed 
to  drastically  reduce  the  number  of  CG  iterations  for  all 
of  our  test  cases.  Since  we  found  these  basis  functions 
within  a  few  days  (during  our  Christmas  holidays)  on  a 
DEC  Alpha  PC,  we  are  confident  to  find  even  better 
basis  functions  in  the  near  future,  i.e.,  we  consider  the 
results  presented  in  this  paper  to  be  intermediate. 


Figure  2:  Some  of  the  optimized  PET  basis  functions 
bnm,  where  n  indicates  the  PET  order  and  m  indicates 
the  number  of  the  basis  function.  Note  that  one  has  n+1 
basis  functions  for  an  n-th  order  PET,  i.e.,  m=^l...n+l. 
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The  PET  optimization  is  an  almost  endless  job  that  can 
be  performed  very  well  in  the  background  -  with  a  low 
priority.  The  intermediate  results  of  the  PET 
optimization  reduce  the  computation  lime  of  MMP 
compulations.  Therefore,  our  MMP  code  with  the  PET 
optimization  in  the  background  behaves  like  a  code  that 
is  continuously  learning  to  faster  compute  MMP  models. 

It  is  important  to  note  that  both  the  PET  itself  and  the 
PET  optimization  can  be  applied  not  only  to  MMP  but 
also  to  any  code  based  on  a  matrix  method  with  a 
sufficiently  dense  matrix.  Since  the  application  of  CG 
matrix  solvers  to  2D  MMP  (with  its  ill-conditioned, 
rectangular  matrices)  is  relatively  hard  and  requires  a 
well  optimized  PET,  we  expect  that  this  technique  can 
easily  be  adapted  to  other  codes.  It  should  also  be 
pointed  out  that  the  PET  is  neither  restricted  to  the 
computation  of  frequency  dependeces  nor  to 
computational  electromagnetics.  We  assume  that  it  will 
be  one  of  the  most  important  instruments  for  a 
successful  and  efficient  MMP  solution  of  inverse 
scattering  and  similar  tasks. 
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I.  Introduction 

A  major  emphasis  within  the  computational  electromagnetics  (CEM)  community  concerns  the  solution 
of  Maxwell’s  differential  equations  using  finite- difference  time-domain  (FDTD)  techniques.  These  algo¬ 
rithms  generally  require  considerable  computational  time  and  storage  resources,  and  as  a  result  their 
application  to  very  large  problems  has  been  somewhat  limited.  New  eflforts  to  alleviate  these  compu¬ 
tational  obstacles  have  focussed  on  the  use  of  parallelism  in  the  FDTD  solutions.  Traditionally,  these 
algorithms  use  spatial  domain  decomposition  or  “space-parallelism’  in  w’hich  each  processor  performs 
the  differencing  in  a  given  spatial  region  at  each  time  step.  For  many  of  these  schemes,  howe\er,  com¬ 
munication  and  synchronization  requirements  tend  to  limit  the  computational  speed-up  enabled. 

This  limited  potential  for  enhanced  computational  efficiency  impbes  that  if  fuU  exploitation  of  the 
capabilities  of  emerging  multiple  instruction  multiple  data  (MIMD)  architectures  is  to  be  realized,  ap¬ 
proaches  must  be  developed  which  represent  a  drastic  departure  from  traditional  FDTD  techniques. 
The  aim  of  this  paper  is  to  present  such  a  computational  strategy.  Unlike  traditional  approaches  which 
rely  solely  on  space-paraUel  solutions,  this  methodology  exploits  parallelism  in  the  temporal  dimension 
as^vell  [1].  This  “time- parallelism”  is  achieved  by  using  the  decoupling  mechanism  of  an  eigenvalue- 
eigenvector  (EE)  decomposition  of  the  FDTD  matrix  to  diagonalize  the  time-stepping  matrix  equation. 
Parallelism  can  then  be  used  to  efficiently  solve  the  resulting  diagonal  linear  recurrence.  The  algorithm  is 
highly  coarse  grain  and  has  minimum  communication  and  synchronization  requirements,  and  is  therefore 
suitable  for  implementation  on  emerging  MIMD  machines.  Preliminary  practical  implementation  of  the 
algorithm  show  that  large  computational  speed-up  for  a  given  size  problem  can  be  achieved. 

This  paper  provides  the  theoretical  and  computational  bases  for  the  time-paraUel  solution  approach 
through  its  application  to  the  Crank- Nicolson  (CN)  solution  of  the  time-domain  scalar  wave  equation. 
The  algorithm  realm  of  applicability  is  discussed,  and  the  classes  of  problems  for  which  the  technique 
provides  results  superior  to  those  of  other  techniques  are  highlighted.  The  incorporation  of  absorbing 
boundary  conditions  into  the  algorithm  is  also  demonstrated.  The  results  of  practical  implementations 
of  the  algorithm  on  the  Intel  Touchstone  Delta  supercomputer  are  provided  for  several  representative 
configurations. 
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II.  Theoretical  Formulation 


A.  CN  Discretization  of  the  Wave  Equation 


An  understanding  of  the  use  of  time-parallelism  in  finite-difference  solutions  of  partial  differential  equa¬ 
tions  (PDEs)  is  perhaps  best  gained  by  applying  the  algorithm  to  a  familiar  problem.  To  this  end, 
consider  a  general  form  of  the  scalar  wave  equation  in  a  homogeneous  medium.  This  PDE  can  be  written 


t) 


trcT/—  fen,  o<t<r 


where  O  is  a  bounded  domain  with  boundary  and  the  initial  and  boundary  conditions  are  specified. 
The  constants  c,  cr,  and  rj  represent  the  speed  of  light,  medium  conductivity,  and  material  intrinsic 
impedance  respectively.  Let  the  vector  represent  a  spatial  ordering  of  the  discretized  field  where 
the  superscript  denotes  the  time  index  {t  =  mAt).  With  this  notation,  the  CN  discretization  of  Eq.  (1) 
can  be  written  as  a  matrix  equation  which  assumes  the  form 


[(1  -f  7)1  -  oA]  -  [(1  +  7)1  -  aA]  -f  (2) 


for  1  <  m  <  M  —  1  where 


I  is  the  unit  matrix 

A  is  the  matrix  arising  from  the  discretization  of  the  Laplace  operator 
is  the  spatially- ordered  solution  vector  at  the  mth  time  step 
is  a  source  vector 

a  =  c^At^/2  and  7  =  acr]At/2  are  constants 
M  =  T/At,  where  At  is  the  time  step  size 
and  are  the  given  initial  conditions 
N  represents  the  spatial  grid  size. 


The  linear  matrix  recurrence  given  in  Eq.  (2)  provides  a  basis  for  discussing  the  mathematical  im¬ 
plications  of  space-  and  time-parallelism.  With  reference  to  Eq.  (2),  space- parallelism  generally  refers  to 
performing  the  matrix  operations  in  a  parallel  fashion  at  each  time  step.  In  contrast,  time-paraUelism 
refers  to  a  parallel  solution  of  the  linear  recurrence  such  that  part  or  all  of  the  time  steps  are  simultane¬ 
ously  computed.  The  goal  of  this  work  is  to  allow  a  combination  of  the  two  types  of  parallelism  in  order 
to  obtain  a  coarse-grain  algorithm  which  is  suitable  for  emerging  MIMD  architectures. 

It  is  noteworthy  that  the  CN  discretization  is  an  implicit  method  which  offers  unconditional  stability 
and  therefore  reduces  the  number  of  time  steps  required  to  complete  a  given  computation.  However,  this 
savings  comes  at  the  cost  of  a  linear  system  solution  at  each  time  step,  which  can  be  very  expensive  in 
comparison  to  the  solution  of  an  explicit  time-stepping  procedure. 


B.  Time-Parallelism 

To  motivate  the  idea  of  time- parallelism,  notice  that  Eq,  (2)  simply  represents  a  Second-Order  Inhomo¬ 
geneous  Linear  Recurrence  (SOILR)  which  can  be  cast  into  a  first- order  form.  This  recurrence  solution 
can  be  fully  parallelized  and  computed  in  ^(logjM)  by  using  the  Recursive  Doubling  Algorithm  (RDA) 
[2]  or  Cyclic  Reduction  Algorithm  (CRA)  [3].  Unfortunately,  such  an  approach  is  not  computationally 
practical  since  both  RDA  and  CRA  compute  powers  of  the  matrices  in  Eq.  (2),  resulting  in  increasingly 
dense  and  ultimately  fuU  matrices.  Nevertheless,  this  observation  indicates  that,  insofar  as  the  data 
dependency  in  the  computation  is  concerned,  the  time-stepping  procedures  can  be  fully  parallelized  in 
time. 
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The  goal  of  the  time- parallel  algorithm  is  therefore  to  manipulate  Eq.  (2)  to  allow  efficient  par¬ 
allelization  of  the  linear  recurrence  solution.  To  see  this,  consider  the  Eigenvalue/ Eigenvector  (EE) 
decomposition  of  the  nonsingular  matrix  A  given  by 

A  =  0A0’^  (3) 

where  0  is  the  set  of  eigenvectors  and  A  is  a  diagonal  matrix  representing  the  set  of  eigenvalues  of  A. 
Substituting  Eq.  (3)  into  Eq.  (2)  and  taking  0  and  0“^  outside  the  parentheses  gives  the  expression 

0[(1  +  7,)I-«A]0-V(’"+'>  =  2t^W-0[(l  +  7)I-aA]0-V<’”-’’ 

+  +  /(“+'’.  (4) 

Defining  the  diagonal  matrix  S  =  [(1  +  7)!  -  «A]-i  and  multiplying  both  sides  of  Eq.  (4)  by  the 
nonsingular  matrix  0"^  gives 

Defining  the  vectors  and  =  30-^^"^^  allows  us  to  write  Eq.  (5)  as 

^(m+i)  ^  -  (I  -  (6) 

In  contrast  to  Eq.  (2),  Eq.  (6)  is  diagonalized  and  can  be  efficiently  solved  in  parallel  using  RDA  or  CRA. 
For  two  and  three  dimensional  problems,  Eq.  (6)  can  be  computed  in  0(A^log2  M)  and  0(A  log2  M) 
respectively  by  using  0(M)  processors. 

C.  Algorithm  Structure 

To  faciUtate  the  discussion  of  the  time-paraUel  algorithm,  we  subdivide  it  into  four  steps  as  lUustrated 
in  Figure  1.  Step  1  involves  determining  the  EE  decomposition  of  the  matrix  A  and  forming  the  matrix 
S.  In  general,  since  a  standard  EE  decomposition  routine  returns  only  ©  and  A,  it  is  necessary  to 
expUcitlv  compute  ©“^  by  computing  the  EE  decomposition  of  A^  =  (0  ^)^A0  .  Note  that  this  can 
be  performed  in  paraUel  with  the  EE  decomposition  of  A.  The  source  vectors  are  computed  and 
multiplied  by  S0“^  in  Step  2  to  obtain  the  vectors  The  parallel  solution  of  the  SOILR  in  Eq.  (6) 

is  accompUshed  in  Step  3.  FinaUy,  the  solution  vectors  are  obtained  in  Step  4  by  performing  the 

matrix- vector  multiplication  0^^^^,  j  j  -u 

It  is  noteworthy  that  the  computations  in  Steps  2  and  4  are  completely  decoupled  and  can  be  per¬ 
formed  in  parallel  with  no  inter-processor  communication.  Step  3  does  involve  communication,  but 
this  data  transfer  has  a  rather  simple  structure  and  can  be  efficiently  implemented  on  MIMD  paraUel 
architectures  since  it  involves  the  exchange  of  large  vectors.  ^  . 

From  a  standpoint  of  computational  complexity,  it  can  be  shown  that  the  matrix- vector  multiplications 
of  Steps  2  and  4  and  the  EE  decompositions  in  Step  1  have  the  potential  to  dominate  the  overall 
algorithmic  computational  cost.  In  light  of  this,  two  key  requirements  for  the  efficient  application  of  the 
algorithm  are 

1.  an  efficient  scheme  for  determining  the  eigenpairs  of  A,  and 

2.  an  efficient  scheme  for  the  matrix- vector  multiplications  in  Steps  2  and  4. 

The  second  issue  is  particularly  important  since  multiplication  of  a  dense  matrix  by  a  vector  leads  to 
a  computational  cost  of  and  0{N^)  for  two  and  three  dimensional  problems  respectively.  This 

fact,  coupled  with  the  first  issue,  motivates  the  exploitation  of  the  sparse  structure  of  A  and  the  use  of 
factored  forms  of  0  and  0“^  to  increase  the  efficiency  of  these  steps. 


594 


Step  1 


Step  2  Step  3  Step  4 


Figure  1:  Flow  diagram  illustrating  the  computational  structure  of  the  time-parallel  algorithm. 

D.  Parallel  Recurrence  Solution 

As  mentioned,  the  parallel  solution  of  Eq.  (6)  can  be  accomplished  using  a  conventional  method  such  as 
RDA  or  CRA.  However,  because  of  the  decoupled  nature  of  this  equation,  it  is  also  possible  to  exploit  both 
time-  and  space-parallelism  to  perform  the  solution  using  a  pipelining  scheme.  Preliminary  numerical 
implementations  performed  by  the  authors  have  shown  that  such  a  scheme  can  be  more  efficient  than  the 
RDA  for  implementations  where  the  number  of  time  steps  M  is  significantly  larger  than  the  number  of 
available  processors  ( J).  This  pipelining  scheme  is  depicted  in  Figure  2.  In  this  algorithm,  processor 
initially  computes  the  Mj J  source  vectors  for  (P~  l)Af/J  +  1  <  <  PMj J.  The  matrix  recurrence 

is  then  divided  into  J  blocks  each  containing  T/ J  equations,  where  L  is  the  total  number  of  equations  in 
the  system.  Processor  #1  computes  the  recurrence  for  the  first  block  for  the  first  MjJ  time  steps  and 
passes  the  result  to  processor  #2  before  working  on  the  second  block.  Processor  #2  then  receives  that 
data,  performs  the  next  Mj  J  steps  in  the  recurrence  for  that  block  while  processor  #1  is  computing  the 
recurrence  for  the  second  block,  and  passes  the  result  to  processor  9^3.  This  process  is  repeated  until  the 
recurrence  is  complete. 

E.  Absorbing  Boundary  Conditions 

For  many  problems,  particularly  enclosed  problems  involving  resonant  cavities  or  wave-guiding  structures, 
the  time-parallel  algorithm  may  be  readily  apphed  since  typically  either  a  Dirichlet  or  Neumann  condition 
is  enforced  on  the  outer  spatial  domain  boundary.  However  for  open  problems  encountered  in  radiation 
and  scattering,  the  issue  of  incorporating  an  absorbing  boundary  condition  (ABC)  [4]  at  the  outer  domain 
boundary  must  be  addressed.  Without  proper  termination  of  the  grid,  outgoing  waves  in  the  domain  wiU 
be  artificially  reflected  by  the  grid  truncation.  For  the  time-parallel  scheme,  proper  choice  of  an  ABC 
is  of  critical  importance.  This  arises  because  the  algorithm  derivation  depends  upon  the  fundamental 
assumption  that  all  matrices  in  Eq.  (2)  are  simultaneously  diagonalizable.  While  this  assumption  holds 
for  conventional  boundary  conditions  (i.e.  Dirichlet,  Neumann,  Mixed,  and  Periodic),  it  is  not  necessarily 
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Figure  2:  Pictorial  description  of  the  pipelined  time-  and  space- parallel  SOILR  solution. 


valid  for  aU  existing  forms  of  ABCs. 

The  method  for  incorporating  ABC's  into  the  algorithm  depends  somewhat  upon  the  geometry  under 
investigation.  However,  in  general,  the  goal  is  to  simultaneously  reduce  the  matrix  equation  to  a  form 
amenable  for  parallel  computation.  For  the  case  of  circularly  cylindrical  structures  examined  below,  it 
is  found  that  the  CN  matrix  equation  can  be  reduced  to  a  block- diagonal  matrix  where  the  blocks  are 
tridiagonal.  Such  a  system  can  be  efRciently  solved  using  the  pipelining  SOILR  solution  discussed  above. 


III.  Application  to  Circular  Cylindrical  Geometries 

The  geometry  to  which  the  time-parallel  algorithm  will  be  applied  is  the  circular  cylindrical  geometry 
depicted  in  Figure  3.  In  this  figure,  the  inner  cylinder  boundary  at  Po  is  considered  a  perfect  electric 
conductor  while  the  outer  cylinder  boundary  will  be  either  a  conductor  or  an  ABC.  These  two  geometries 
will  be  used  to  compute  the  fields  inside  a  coaxial  cavity  as  weU  as  the  plane-wave  scattering  from  a 
cylindrical  structure.  It  will  be  assumed  that  in  all  computations  the  fields  are  TM^. 

Application  of  the  time-parallel  algorithm  requires  first  a  knowledge  of  the  EE  decomposition  for  the 
matrix  A.  Because  this  derivation  is  somewhat  involved,  only  the  final  results  will  be  presented  here. 
Those  interested  in  specific  details  should  consult  Reference  [5]. 

A.  Conducting  Outer  Surface 

When  a  conductor  is  placed  at  the  outer  domain  contour,  the  boundary  condition  on  the  ^-polarized 
electric  field  will  be  of  the  homogeneous  Dirichlet  type.  For  this  configuration,  with  N  azimuthal  and  K 
radial  discretization  points  in  the  spatial  grid,  the  EE  decomposition  of  the  matrix  A  can  be  shown  to 

&  =  TVQ  (0 

where 

Q  =  Diag{Qj}  €  l<j<N 

V  6  is  a  permutation  matrix 

-  Diag{F}  6  ^  where  F  is  a  A  x  N  one-dimensional  discrete  Fourier  transform  (DFT). 

Therefore,  multiplying  a  vector  by  the  matrix  ©  is  tantamount  to  multiplying  by  a  block 
diagonal  matrix  with  K  x  K  blocks,  performing  a  permutation,  and  performing  K  DFT’s  of  length  N. 
It  should  also  be  mentioned  that  computing  the  matrix  ©  amounts  to  finding  the  N  matrices  Qj  by 
computing  the  EE  decomposition  of  N  K  x  K  sign-symmetric  matrices.  This  operation  can  be  performed 
in  parallel. 
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Pi  =  (p+l)Ap 


Figure  3:  Geometry  and  gridding  for  the  CN  solution  of  Maxwell’s  equations  near  a  perfectly  conducting 
circular  cylinder. 

B.  ABC  at  Outer  Surface 

When  the  outer  domain  boundary  is  an  ABC,  the  resulting  CN  matrix  equation  assumes  the  general 
form 

(I  -  -  (I  -  (8) 

where  V  is  a  diagonal  matrix.  It  can  be  shown  that  the  matrix  A  is  reducible  to  a  convenient  form  using 
the  de^mposition  A  =  0oL0J^  where  0^  =  !FV  and  L  is  block  diagonal,  L  =  Diag{Lj},  1  <  j  <  N , 
where  is  tridiagonal.  Using  this  decomposition  in  Eq.  (8)  results  in  the  equation 

(I  -  =  2^^^^  -  (I  -  (9) 

where  and  V  remains  diagonal  but  has  been  transformed  by  a 

permutation.  Defining  =  ^(’^+1)  +  and  allows 

expression  of  Eq.  (9)  as 

=  (10) 

Because  of  the  block  diagonal  structure  of  L,  the  linear  system  solution  in  Eq.  (10)  is  equivalent  to 
N  decoupled  tridiagonal  system  solutions.  This  decoupled  form  allows  the  recurrence  solution  to  be 
performed  using  the  pipelining  method. 


IV*  Results  of  Algorithm  Implementation 

In  this  section,  the  results  of  applying  the  algorithm  to  the  circular  cylindrical  geometry  on  the  Touchstone 
Delta  supercomputer  wiU  be  presented.  In  all  of  the  computations,  the  parameters  are  M  =  512  =  #  time 
steps,  K  =  32  =  of  radial  grid  points,  and  Ap  =  0.1  A  =  radial  discretization  size,  and  />i  —  po  =  3A. 
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Speedup  With  Reference  To: 

—  Best  Sequential  —  Time -Parallel 


Speedup  With  Reference  To: 

Best  Sequential  Algorithm 


Figure  4:  Computational  speedup  versus  the  number  of  processors  used  to  solve  for  the  fields  in  the 
coaxial  cylinder  using  the  time-parallel  approach  with  pipelining  in  the  SOILR:  (a)  single  computation; 
(b)  64  computations  with  different  source  configurations  for  the  same  geometry.  Case  1:  po  =  1-6A;  Case 
2:  po  =  16A. 


A.  Closed  Coaxial  Cavity 

As  a  first  example,  let  us  consider  the  closed  coaxial  cavity  configuration  such  that  a  Dirichlet  boundary 
condition  is  used  on  the  outer  computational  domain  boundary.  The  inset  of  Figure  4  illustrates  the 
geometry  for  the  problem  and  lists  the  dimensions  for  the  two  cylinders  under  investigation.  The  exci¬ 
tation  is  uniform  about  the  interior  cavity  perimeter.  The  azimuthal  discretization  uses  N  =  128  and  N 
=  1024  for  Cases  1  and  2  respectively. 

Figure  4(a)  illustrates  the  speedup  achieved  as  the  number  of  processors  assigned  to  the  problem 
increases.  The  “IDEAL  SPEEDUP”  curve  suggests  that  in  the  absence  of  communication  and  syn¬ 
chronization  costs,  the  speedup  equals  the  number  of  processors  used.  In  reality,  however,  the  achieved 
speedup  is  less  than  ideal,  as  expected.  The  speedup  is  measured  with  reference  to  two  different  sequential 
computation  times.  The  “Best  Sequential”  time  Tf  ^  for  solution  of  this  problem  corresponds  to  using 
the  Fast  Poisson  Solver  [6]  for  the  linear  system  solution.  The  “Time-ParaUel”  time  represents  the 
time  required  to  perform  the  algorithm  introduced  here  on  a  single  processor.  In  all  cases,  a  very  good 
speedup  is  achieved  using  the  time-parallel  approach.  It  is  noteworthy  that  the  computational  speedup 
improves  as  the  problem  size  increases  due  to  a  corresponding  increase  in  the  computational  grain  size. 

Close  investigation  of  the  relative  times  required  to  complete  each  of  the  steps  in  Figure  1  reveals  that 
Steps  1  and  3  represent  a  significant  portion  of  the  algorithm  computational  time.  This  occurs  because 
of  the  communication  overhead  associated  with  these  steps.  It  is  noteworthy  that  if  the  methodology  is 
applied  to  problems  for  which  the  EE  decomposition  is  known  explicitly,  then  the  time  devoted  to  Step 
1  is  reduced  to  an  almost  negligible  level.  For  the  problem  at  hand,  it  becomes  evident  that  the  time- 
parallel  algorithm  will  perform  particularly  well  for  problems  in  which  the  same  geometry  is  simulated  for 
many  different  source  configurations.  In  such  a  situation,  the  EE  decomposition  needs  to  be  computed 
only  once  for  all  of  the  CN  solutions.  As  an  example,  consider  that  the  source  in  the  cavity  is  non-zero 
only  over  a  specified  azimuthal  region,  and  it  is  desired  to  determine  the  region  size  for  which  optimal 
coupling  into  the  cavity  occurs.  The  speedup  of  the  algorithm  assuming  that  64  different  source  region 
sizes  are  used  is  provided  in  Figure  4(b).  These  results  show  significantly  improved  algorithmic  speedup, 
as  expected.  This  demonstrates  the  importance  of  minimizing  the  relative  computation  time  of  the  EE 
decomposition  in  order  to  obtain  an  algorithm  with  high  speedup  for  a  fixed  problem  size. 
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Speedup  With  Reference  To: 

Best  Sequential  Algorithm 


Case  1 

Po  =  1 .6X, 

Case  2 

Po  =  1 6  A, 


Figure  5:  Computational  speedup  versus  the  number  of  processors  used  to  solve  for  the  fields  scattered 
by  a  cylinder  using  the  time- parallel  approach  with  pipelining  in  the  SOILR. 

B.  Scattering  from  a  Circular  Cylinder 

As  a  second  example,  consider  the  computation  of  scattering  from  a  circular  cylinder  where  an  ABC 
must  be  placed  at  the  outer  domain  boundary.  Figure  5  illustrates  the  geometry  and  computational 
speedup  achieved  in  this  investigation.  The  azimuthal  grid  sizes  used  are  identical  to  those  used  for  the 
computation  in  Figure  4.  Here  again,  the  speedup  results  show  that  the  computational  grain  size  is  larger 
for  the  larger  problem,  resulting  in  an  improved  speedup  result.  It  is  also  interesting  that  the  speedup 
performance  is  very  similar  to  that  seen  in  Figure  4(a)  where  a  purely  diagonal  system  was  obtained.  It 
would  appear  from  this  simple  study  therefore  that  for  this  configuration,  the  use  of  the  block  diagonal 
form  does  not  severely  degrade  the  achievable  computational  speedup, 

V.  Conclusions 

In  this  paper,  a  novel  approach  for  the  time-  and  space- parallel  solution  of  FDTD  algorithms  has  been 
presented.  The  theoretical  and  computational  bases  of  the  scheme  have  been  detailed,  and  its  practical 
application  on  a  MIMD  architecture  has  been  demonstrated. 
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Abstract  —  A  technique  is  presented  to  reduce  the  computation  time  for  the  evaluation  of 
the  impedance  matrix  for  Method  of  Moments  (MoM)  discretization  of  the  integral  equation 
formulation  of  antenna  and  scattering  problems.  This  technique,  called  “matrix  synthesis  , 
requires  that  only  a  fraction  of  the  matrix  entries  be  computed  exactly,  while  the  rest  is  derived 
through  an  approximation  scheme.  Here  it  is  described  for  printed  structures  on  plane-stratified 
media,  and  applications  to  microstrip  structures  are  presented. 


Introduction 

We  consider  here  the  full-wave,  standard  integral-equation  formulation  for  antenna  or  scattering 
problems,  discretized  according  to  the  Method  of  Moments  (MoM),  so  that  N  unknowns  are 
obtained.  The  computation  time  Tc  is  the  sum  of  the  time  Tj  necessary  to  fill  in  the  matrix 
(filling  time),  and  the  time  required  for  the  factorization  (or  inversion)  of  the  linear  system. 
Although  typical  estimates  yield  Ts  oc  iV^,  and  Tj  ~  ^  where  tint  is  the  time  required  for 

the  evaluation  of  a  typical  reaction  integral,  because  of  the  complexitiy  of  the  integrals,  Tj  is 
always  important,  and  often  dominant.  Here  we  present  a  scheme  that  allows  a  reduction  of  the 
filling  time  Tj. 

In  the  technique,  briefly  called  Matrix  Synthesis  hereafter,  only  a  selected  fraction  of  the  total 
number  of  required  matrix  entries  is  computed,  the  rest  being  obtained  through  an  approximation 
scheme.  The  scheme  is  based  on  the  consideration  that  the  quantity  of  information  required  for 
the  representation  of  a  field  (degrees  of  freedom,  [1])  is  usually  less  that  the  number  (^  N  /2) 
of  independent  matrix  entries.  This  scheme  has  the  same  objectives  as  those  reported  m  [2]; 
however,  although  it  achieves  a  lower  efficiency,  it  has  the  advantages  of  a  greater  simplicity,  and 
of  being  directly  applicable  to  any  existing  code  (e.g.  NEC). 

Here  we  will  assume  sub-domain  basis  functions  (e.g.  rooftops,  linear  functions  on  triangular 
domains,  etc.)  such  as  those  used  for  describing  genera]  structures,  so  that  the  entire  set  of 
functions  can  be  divided  in  classes  of  spatially  translated/rotated  equal  basis  functions  (e.g. 
triangles  of  the  same  size).  We  further  assume  a  Galerkin  testing  scheme. 


Matrix  Synthesis 

In  the  global  system  (“impedance”)  matrix,  we  consider  here  any  of  the  submatrices  related  to 
any  of  the  subsets  in  which  the  sub-domain  basis  functions  can  be  divided.  This  (sub)matrix 
Z  =  [Zmn]  is  a  mapping  between  the  set  of  integer  pairs  {(m,n),m,n  =  I  :  N}  and  the  complex 
plane  Z.  First,  the  mapping  between  the  indices  (m,n)  and  the  matrix  entry  E-rnn  —  Z{m,n)  is 
regarded  as  a  collection  of  samples  of  the  mapping  between  a  continuous  set  {x,y)  and  Z\  that 
is,  the  discrete  wiables  {m,n)  are  replaced  by  the  continuous  ones  [x,y). 
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The  function  Z{x^y)  can  then  be  approximated  by  fitting  a  suitable  set  of  functions  {/a(^)2/)) 
q;  =  1  :  M}  (e.g.  polynomials)  to  a  small  number  Ns  of  samples  of  the  function  Z{x,y),  each 
sample  being  a  reaction  integral.  The  matrix  Z  is  then  obtained  by  evaluating  the  approximated 
function  Z(x^y)  at  discrete  points  of  the  {x,y)  plane.  The  number  of  required  points  and  their 
location  clearly  depends  on  the  properties  of  the  kernel  of  the  reaction  integrals:  in  particular, 
the  approximation  works  well  only  when  the  dependence  of  Z  on  {x,y)  (that  is,  on  (m,n))  is 
smooth  enough.  The  reaction  integral  between  two  current  distributions  and  J_n  has  the  form 
Zmn  =  Jin)  where  Q  is  the  appropriate  dyadic  Green’s  function  and  stands  for  the 

convolution  integral.  This  integral  depends  on  the  centers  and  of  the  sub-domains,  on 
the  relative  orientation  of  the  sub-domains  and  of  the  currents.  Therefore,  the  global  matrix  Z 
has  to  be  partitioned  into  blocks  that  represent  interactions  between  current  components  that 
vary  smoothly  (or  do  not  vary  at  all)  from  one  node  to  another  (e.g.  azimuthal  or  radial  current 
component  on  a  disk,  the  two  orthogonal  current  components  on  a  rectangular  plate,  etc.). 
Clearly,  matrix  elements  representing  interactions  from  sub-domains  at,  or  close  to,  an  abrupt 
discontinuity  (e.g.  a  right-angle  bend  in  a  microstrip)  are  excluded  from  this  considerations. 

Printed  Structures 

As  an  application  of  this  technique,  we  consider  here  a  plane,  or  plane-stratified  structure,  with 
sub-domain  basis  functions.  Further,  we  consider  the  reaction  Z^n  —  ^mn  between  the  basis 
functions  —  Lm)  Xn[t)  =  W2‘^o(l.”£n)  with  ui,2  being  two  unit  vectors  that 

do  not  depend  on  the  locations  Lm,n^  is  the  chosen  ’’elemental”  basis  function  (e.g.  a 

rooftop).  With  these  assumptions,  the  entries  Zmn  of  the  considered  sub-matrix  are  a  function 
of  the  distance  between  these  two  points  only,  Z-mn  —  ~  ~ 

direct  replacement  of  by  a  continuous  (vector)  variable  g  one  proceeds  then  to 

the  approximation  of  the  function  Z{x,y).  Although  other  forms  of  fitting  could  be  used,  here 
we  will  consider  Least-Squares  (LSQ)  fitting  of  suitable  functions  fa{x^y)  to  the  Ns  samples 
{Zi  =  Z{g.),i=l:Ns}. 

Both  in  free-space  and  in  a  layered  medium,  one  can  extract  from  the  Green’s  function  £  a 
strongly  singular  term  that  behaves  like  1/q^  for  x  — >  0,  and  a  ’’regular”  (integrable)  term 

that  is  bounded  by  1/q.  The  singular  term  can  be  rendered  independent  of  frequency,  is 
purely  real  or  imaginary,  and  represents  the  static  part.  Therefore,  it  is  convenient  (in  general) 
to  separate  Z  in  two  terms  according  to  the  above:  these  will  have  a  rather  different  dependence 
on  the  separation  distance  q. 

The  term  Zs(g)^  due  to  the  static,  singular  part  of  G,  will  be  sharply  peaked  about  g  =  0  and 
decay  abruptly  for  growing  separations  q,  with  a  variation  at  most  equal  to  l/q^.  Note  that  this 
part  is  frequency-independent. 

Conversely,  the  term  Zr{g)  due  to  the  ’’regular”  part  of  Q.  is  smoother  for  a:  =  0,  and  dom¬ 
inated  by  the  oscillating  behaviour  of  Green’s  function;  in  fact,  for  small  enough  sub-domains 
Z{g)  is  approximated  by  the  Green’s  function  itself.  The  spatial  frequency  of  the  oscillations 
can  be  readily  obtained  by  inspection  of  the  appropriate  Green’s  function,  and  this  will  set  the 
spatial  sampling  rate  and  the  number  of  samples  per  wavelength.  On  the  basis  of  the  spectral 
representation  theory  and  contour  deformation,  (e.g.  [3]  )  one  can  ascertain  that  away  from  ^  =  0 
the  dominant  contribution  will  have  a  spatial  period  not  smaller  than  the  shortest  wavelength  in 
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the  layered  medium. 


Applications  and  validation 

This  matrix  synthesis  technique  has  been  applied,  for  the  sake  of  validation,  to  the  following 
different  configurations:  a  microstrip  line,  an  L-shaped  patch,  subdivided  in  rectangular  subdo¬ 
mains,  and  a  narrow  rectangular  patch  with  a  general  mesh  (triangular  based). 

1  On  a  microstrip  line,  only  the  longitudinal  current  component  is  considered,  so  that 
the  matrix  elements  depend  only  on  the  longitudinal  distance  between  centers^  The  line 
has  a  length  L  =  i.27Xj  (\i  is  the  wavelength  in  the  medium,  e,  -  2.33),  and  the  current 
on  it  is  expanded  in  terms  of  105  rooftop  basis  functions:  the  relevant  Z-matrix  is  a,  Toeplitz, 
symmetrical  matrix  and  it  is  only  necessary  to  evaluate  and  approximate  one  row.  Followmg  t  e 
scheme  described  above,  the  “singular”  and  the  “regular”  part  of  the  matrix  are  approximated 
separately.  For  the  singular  part  Z,{g),  whose  variation  is  expected  to  be  between  1/?  and  1/q 
with  a  wide  dynamics,  two  possible  types  of  approximation  have  been  compared,  both  obtained 
by  fitting  Ns  equispaced  samples:  a)  a  direct  approximation  of  Zsig)  by  means  of  a  rational 
function,  and  b)  a  polynomial  approximation  of  log  Zs{q)- 


Figure  1.  Singular  part  of  Z*"*:  polynomial  Figure  2.  Singular  part  of  Z*"*:  log  of  ratio- 

approximation  of  log  Zs  nal  approximation  of  Z, 

Figs.  1  and  2  show  that  both  approximations  work  well;  however,  the  “logarithmic-l-polynomial 
is  easier  and  preferable,  because  the  fitting  of  log  Zs{g)  corresponds  to  a  linear  LSQ  fit  while  the 
rational  fit  relies  on  a  non-linear  optimization.  Moreover,  the  rational  fit  is  less  efficient  in  cover¬ 
ing  the  small  separation  range.  In  both  cases,  the  number  of  samples  could  be  further  on  reduced, 
by  using  a  non-uniform  spacing  sampling. 

The  real  and  immaginary  part  of  the  regular  matrix  Zr  are  approximated  separately,  poly- 
nomially.  The  results,  not  reported  here,  show  that  =  18  samples  are  sufficient  to  correctly 
reproduce  all  the  matrix  entries  within  a  1%  error  bound. 
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2.  The  L-shaped  patch  can  be  seen  as  a  pair  of  ortogonal,  partly  overlapping  microstrips; 
on  each  arm  the  current  can  be  considered  only  longitudinal  (x  or  y  directed),  so  that  in  the 
non-overlapping  zones  only  one  current  component  is  present,  while  where  the  two  strip  are 
superimposed  both  current  components  are  present.  Each  arm  has  a  length  L  —  0.28 and  is 
divided  in  =  25  rooftop  basis  functions.  The  resulting  Z-matrix  can  be  partitioned  in  four 
submatrices:  jivv)  express  the  interaction  between  current  elements  with  the  same 

direction  and  that  are  the  coupling  matrices  between  ortogonal  current  elements. 

The  entries  of  the  first  two  matrices  depend  only  on  the  longitudinal  distance  between  the  basis 
functions,  so  that  the  approximation  can  be  carried  out  in  the  same  way  described  above  for  the 
microstrip  line.  In  the  case  of  the  coupling  submatrices,  the  distance  between  two  subdomains 
depends  either  on  the  longitudinal  or  on  the  transverse  coordinate;  so  the  elements  Z^n  are 
samples  of  a  function  of  both  x  and  y  coordinates,  that  is  suitably  approximated  via  a  two- 
dimensional  fitting.  The  approximation  has  been  applied  separately  to  the  singular  and  to  the 
regular  part  of  the  results  reported  in  Tab.  1  show  a  substantial  reduction  of  the 

number  of  reaction  integral  evalution,  with  an  error  between  the  exact  and  the  synthetized  matrix 
less  than  1%;  it  must  be  observed  that,  while  for  the  regular  part  the  whole  matrix  has  been 
approximated,  in  the  case  of  the  singular  part  the  approximation  concerns  only  the  smoother 
section  of  the  matrix,  i.e.  the  one  not  involving  overlapping  subdomains,  that  is  the  only  portion 
of  the  matrix  in  which  the  approximation  is  meaningful:  aside  from  their  abrupt  variation,  the 
elements  in  the  overlapping  zone  are  ver}^  few. 


Nj 

N 

sing,  part  Z^^^l 
(only  a  section) 

7x7 

(49) 

18  X  18 
(324) 

reg.  part  Z^^^l 
(whole  matrix) 

8x8 

(64) 

25  X  25 
(625) 

Table  1:  Comparison  between  the  number  of  reaction  integrals  evalution  without  (A^)  and  with 
[Nf)  approximation 

3.  The  matrix  synthesis  technique  is  applied  to  a  triangular  mesh  on  which  linear  functions 
are  defined  and  then  suitably  grouped  to  obtain  pyramidal  basis  fuction.  Such  a  mesh  has  the 
advantage  of  being  suitable  for  any  patch  shape,  but  it  has  the  drawback  that  the  elements  of  the 
associated  Z-matrix  are  significantly  more  time-expensive:  it  is  therefore  particularly  important 
to  reduce  the  number  of  entries  to  be  evaluated. 

Here  the  case  of  a  dipole,  where  the  current  is  only  longitudinally  directed,  is  considered: 
the  triangular  subdomains  are  grouped  in  one  row  of  =  20  hexagonal  sub-domains  plus  two 
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Figure  3:  Example  of  grouping  of  the  tranguiar  sub-domains  on  a  rectangular  geometry 

outer  rows  of  njj  pentagonal  sub-domains  (see  Fig.  3);  this  means  that  the  corresponding  matrix 
can  be  partitioned  in  different  submatrices,  expressing  the  interaction  between  two  hexagonal 
pyramids  two  pentagonal  pyramids  or  a  pentagonal  and  a  hexagonal  pyramidal 

basis  function  The  interaction  between  sub-domains  on  the  z-th  and  j-ih  row 

depends  only  on  longitudinal  distance  between  centers  and  the  matrix  elements  can  be  seen 
again  as  samples  of  a  one- variable  function. 


Figure  4.  Singular  part  of  Figure  5.  Real  and  imaginary  part  of  regular 

term  of 

The  results  of  the  approximation  for  the  elements  Zn  —  self- impedance  matrix 

are  shown  in  Figs.  4  and  5  for  the  singular  and  regular  parts,  respectively;  the  solid  line 
shows  the  exact  values  of  Zn,  the  dotted  line  represents  the  polynomial  approximation  obtained 
by  fitting  the  Ns  =  S  samples,  equispaced  except  in  the  final  portion  of  the  curves,  as  emerges 
from  the  plots. 

Figs.  6  and  7  show  the  results  relative  to  the  approximation  of  the  sub-matrix:  also  in 
this  case  8  samples  are  sufficient  for  correctly  synthetize  the  entire  row,  and  consequently  the 
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whole  matrix. 


Figure  6.  Singular  part  of  Figure  7.  Real  and  imaginary  part  of  the 

regular  part  of 

The  results  concerning  the  other  submatrices  are  similar,  and  for  this  reason  are  not  reported 
here:  in  fact  those  shown  in  Figs,  4  to  7  are  sufRcient  to  conclude  that  also  in  this  case  the 
approximation  is  good:  the  differences  between  the  exact  and  approximated  values  are  negligible, 
and  a  reduction  of  a  factor  of  2-3  of  the  matrix  entry  evaluation  time  is  achieved. 

Concluding,  we  can  say  that  this  technique  leads  to  a  drcistical  reduction  of  the  filling  time, 
which  becomes  Tj  oc  ^{rN^)tini  where  the  reduction  factor  r  is  typically  in  the  range  1/2  -i- 1/9. 
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Abstract 

The  Fast  Multipole  Method (FMM)  is  a  technique  for  computing  scattering  and  radiation  from  large  objects 
using  less  memory  and  CPU  time  required  by  dense  techniques.  When  combined  with  high-order  discretizations 
of  the  surface  currents,  the  FMM  can  be  used  to  solve  problems  on  workstations  that  require  supercomputers 
using  dense  techniques.  We  will  present  the  results  of  cross  section  calculations  for  bodies  whose  cross  sections 
have  a  large  dynamic  range.  When  using  low-order  basis  functions,  the  FMM  requires  less  memory  and  cpu  time 
than  dense  techniques  for  objects  above  a  certain  size.  When  high-order  basis  functions  are  used,  the  crossover 
point  occurs  for  larger  objects.  However,  for  bodies  larger  than  this  threshold,  the  FMM  and  high-order 
discretizations  combine  to  permit  solutions  with  lower  memory  and  cpu  requirements  than  either  enhancement 
used  alone. 


1  Introduction 

The  Fast  Multiple  Method(FMM)[l]  can  reduce  the  asymptotic  complexity  of  iterative  scattering  cross  section 
computations  by  a  factor  oi  O  [N/ \og[N))  compared  to  that  required  by  dense  matrix  techniques.  When  used 
with  a  Galerkin  discretization  of  the  scattering  integral  equations,  it  depends  on  the  basis  functions  being  localized 
on  the  wavelength  scale.  Use  of  high-order  basis  functions  with  relatively  broad  support  permits  more  efficient  and 
accurate  cross  section  calculation  than  using  low-order  basis  functions[2] . 

When  both  of  these  techniques  are  used  together,  the  optimal  basis  function  order  and  support  size  for  dense 
techniques  is  not  the  optimal  one  for  the  FMM.  For  relatively  small  problems,  it  is  more  efficient  to  use  dense 
techniques  than  the  FMM.  Even  when  the  FMM  is  more  efficient  than  dense  techniques  that  use  pulse  or  rooftop 
basis  functions,  it  is  sometimes  better  to  use  dense  techniques  with  high-order  basis  functions  having  broad  support 
than  to  use  the  FMM  with  basis  functions  which  are  more  localized. 

We  will  illustrate  the  interplay  of  the  FMM  and  high-order  basis  functions  in  optimizations  of  computations  for 
boomerang-  and  bat-shaped  objects  of  various  sizes.  When  the  size  of  these  objects  increases  beyond  one  hundred 
wavelengths,  the  cross  section  has  a  large  dynamic  range  (more  than  one  million  to  one  for  a  boomerang  with  600A 
long  arms,  for  instance).  Accurate  computation  of  the  cross  section  requires  careful  numerical  integrations  and 
high-order  discretizations. 

We  solved  these  scattering  problems  by  discretizing  the  2d  electric  field  integral  equation  using  the  Galerkin 
technique.  Our  basis  functions  consist  of  localized  polynomials  in  the  parametric  surface  coordinate.  We  used  the 
single  stage  FMM  with  a  theoretical  asymptotic  complexity  of  O  where  N  is  the  number  of  unknowns  in 

the  problem.  The  data  for  memory  usage  and  CPU  time  per  iteration  follow  this  theoretical  prediction  remarkably 
well. 

W^e  will  present  our  technique  for  reliably  determining  the  accuracy  for  our  solutions  to  these  problems.  Given 
this  accuracy  measure,  we  will  show  how  CPU  time  and  memory  for  a  given  accuracy  can  be  optimized  as  a 
function  of  quadrature  order,  basis  function  order,  FMM  group  diameter,  and  patch  size. 

tXhis  research  was  supported  in  part  by  the  Advanced  Research  Projects  Agency  of  the  Department  of  Defense  and  was  monitored 
by  the  Air  Force  Office  of  Scientific  Research  under  Contract  Number  F49620-91-C-0064.  The  United  States  Government  is  authorized 
to  reproduce  and  distribute  reprints  for  governmental  purposes  notwithstanding  any  copyright  notation  hereron. 
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2  Background 

In  this  section,  I  will  provide  a  brief  summary  of  the  FMM  and  the  high-order  techniques  employed  in  these 
computations. 


2.1  General  Background 

We  are  interested  in  solving  the  scalar  Helmholtz  equation 

subject  to  the  Dirichlet  boundary  condition  ^  =  0  on  the  scatterer  surface  S  and  the  radiation  condition  at  oo. 
Let  the  field  rj;  consist  of  an  incident  plane  wave  0(x)  and  a  scattered  wave  cp  having  the  asymptotic  form 


The  differential  cross  section  is  given  by 


<P  =  /(^)— 7=-  ;  r  — >■  oo  . 

y/r 


Eq.  (1)  can  be  transformed  into  the  integral  equation 

=  ~  J  dx' G{x,x')a{x')  ]  xonS  ,  (4) 

where 

=  -  ^Inn^  n  •  V^(x)  ,  (5) 

n  being  the  unit  normal  pointing  outward  from  5. 

Eq.  (4)  was  discretized  for  numerical  solution  by  the  Galerkin  technique.  One  expands  er  in  terms  of  a  set  of 
basis  functions  {6,-} 

=  ,  (6) 

obtaining  the  matrix  equation 


^ij  =  -i  J^dxbj{x)  J  dx' G{x,x*)bi{x)  . 


Efficient  numerical  solutions  of  Eq.  (4)  depend  on  the  appropriate  choice  of  basis  functions.  We  have  found 
that  orthogonal  polynomials  in  a  parametric  surface  coordinate  provide  a  choice  that  is  efficient  to  compute  and 
provides  rapid  solution  convergence [2].  For  maximum  efficiency,  one  must  use  accurate  representations  of  the 
scattering  surface[3].  The  basis  functions  are  defined  to  be  non-zero  only  on  localized  regions  of  the  surface  we 
call  “patches”.  ’ 


2.2  Fast  Multipole  Method 

The  FMM  is  a  sparse  factorization  of  the  impedance  matrix  Z  defined  in  Eq.  (9).  In  its  simplest  form,  the  matrix 
2  can  be  expressed  to  arbitrary  precision  as  a  combination  of  three  sparse  operators:  a  “near”  interaction  term 
,  a  far  field  transformation  V  and  a  translation  operator  T, 

z^z^^v^TV.  rim 
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Figure  2:  Geometry  of  50 A  boomerang 

The  FMM  operator  is  constructed  by  arranging  the  basis  functions  into  groups  where  the  basis  functions  in  each 
group  have  supports  that  are  near  each  other.  The  diameter  of  a  group  D  is  the  diameter  of  the  smallest  circle 
enclosing  the  supports  of  all  basis  functions  in  the  group.  The  optimal  group  size  occurs  when  the  number  of 
groups  is  approximately  equal  to  the  square  root  of  the  number  of  unknowns. 

Clearly,  the  group  diameter  D  can  be  no  smaller  than  the  largest  support  of  the  basis  functions  in  the  group. 
Sometimes  one  must  trade-off  the  optimum  group  size  for  the  most  efficient  FMM  factorization  against  the  patch 
size  that  takes  best  advantage  of  high  order  basis  functions. 


3  Geometries 

We  used  two  geometries  for  this  study,  the  boomerang  and  bat  shown  in  Figure  1.  oriented  so  that  0°  is  nose-on. 
We  will  characterize  the  size  of  each  object  by  the  distance  L  from  the  nose  to  the  tip  of  the  leading  edge.  The 
variously  sized  objects  are  obtained  by  increasing  the  length  L  and  keeping  the  radius  of  curvature  of  the  corners 
constant.  For  example,  larger  geometries  are  shown  in  Figure  ??. 

These  geometries  were  chosen  to  be  geometrically  simple,  but  challenge  the  numerical  capabilities  of  our  com¬ 
puter  program  because  of  their  large  dynamic  range  of  cross  sections.  Nose  on,  the  cross  section  of  both  ob¬ 
jects  is  approximately  that  of  a  circle  with  the  same  radius  as  the  radius  of  curvature  of  the  nose  (r  =  0.625A, 
daldB  =  0.3217A).  On  the  other  hand,  the  tails  look  like  one  or  two  corner  reflectors.  The  cross  sections  for  waves 
incident  on  the  tail  is  approximately  equal  to  the  square  of  the  span  of  the  objects.  For  an  object  with  a  length  of 
1500A  the  ratio  of  monostatic  cross  sections  from  the  tail  and  nose  of  the  objects  is  1.4  x  10"^. 
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Figure  3:  Cross  section  and  relative  error  as  a  function  of  angle  for  a  1500A  bat. 

4  Error  Analysis 

Meaningful  comparisons  of  various  techniques  for  computing  cross  sections  require  accurate  error  estimates.  Our  use 
of  high-order  methods  makes  reliable  estimates  fairly  inexpensive  to  compute.  Given  a  solution  with  a  particular 
estimated  accuracy,  we  can  compute  another  solution  with  an  order  of  magnitude  smaller  error  with  a  modest 
increase  in  computation  time  and  memory.  We  estimate  the  error  in  the  second  solution  to  be  the  difference 
between  the  two  solutions. 

We  have  verified  the  postulated  high-order  solution  convergence  by  following  the  above  procedure  with  scattering 
from  circles,  where  an  independent  series  solution  is  available.  We  have  also  found  that  the  solutions  with  increasing 
order  converge  rapidly  to  the  assumed  exact  solutions  for  the  problems  reported  here. 

As  mentioned  in  the  introduction,  these  targets  were  chosen  to  have  a  large  dynamic  range  in  their  cross 
sections.  For  the  largest  target,  a  1500A  bat,  there  is  a  dramatic  difference  between  the  relative  errors  in  regions 
of  low  and  high  cross  sections  as  shown  in  Figure  3.  Because  of  this,  denser  discretizations  and  more  accurate 
quadratures  were  required  for  the  larger  scatterers  than  for  the  smaller  ones. 

5  Results 

In  order  to  illustrate  the  separate  and  combined  effects  of  the  FMM  and  high-order  techniques,  we  will  compare 
CPU  and  memory  requirements  for  four  situations: 

•  low-order  dense  solutions, 

•  high-order  dense  solutions, 

•  low-order  FMM  solutions,  and 

•  high-order  FMM  solutions. 

The  CPU  time  reported  is  the  total  time  to  solve  the  problem.  In  the  dense  case  (using  a  direct  sovler),  it  is 
the  sum  of  the  time  to  fill  the  impedance  matrix,  the  time  to  do  an  LU  decomposition  of  the  matrix,  and  the  time 
to  solve  Eq.  (7)  for  multiple  right-hand  sides.  Let  N  be  the  number  of  unknowns  in  the  problem.  The  fill  time  is 
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Figure  4:  CPU  time  required  for  dense  and  FMM  computation  of  the  cross  section  of  a  boomerang.  Low-order 
basis  functions  were  used  in  both  cases. 


proportional  to  and  dominates  for  small  problems.  For  a  given  N  it  is  usually  faster  to  compute  the  matrix 
elements  using  high-order  basis  functions  than  low-order  ones.  The  LU  decomposition  time  is  proportional  to  iV 
and  dominates  for  large  problems.  The  solution  time  of  Eq.  (7)  is  proportional  to  but  has  a  smaller  coefficient 
than  the  fill  time  (unless  the  far  field  is  grossly  oversampled).  .  r  , 

In  the  case  of  the  FMM,  there  are  two  significant  components  of  the  CPU  time:  the  computation  of  the  FMM 
operator,  and  the  iterative  solution  of  the  matrix  equation  at  each  angle.  The  FMM  setup  time  has  an  asymptotic 
complexity  of  O  In  small  to  medium  problems  where  a  limited  number  of  scattering  angles  are  computed, 

this  time  dominates.  For  a  given  N  and  FMM  group  diameter  D,  it  is  usually  faster  to  compute  the  FMM  operator 
using  high-order  basis  functions  than  low-order  ones.  For  the  largest  problems,  the  CPU  time  is  dominated  by 
the  FMM  solution  time.  Each  iteration  in  the  FMM  solution  has  an  asymptotic  complexity  of  O  (N  '  ).  The 
complexity  of  the  iterative  solver  we  used  is  an  open  question.  We  use  a  simple  block-diagonal  preconditioner,  the 
biconjugate  gradient  iterative  solver,  and  intelligent  starting  guesses  for  the  surface  currents.  This  combination 
appears  to  have  a  complexity  no  higher  than  O 

When  the  desired  accuracy  could  be  achieved  in  single  precision,  the  impedance  matrix  and  FMM  operator 
were  stored  that  way.  This  was  possible  on  all  but  the  largest  problems.  In  addition,  the  matrices  Z  and  Z  use 
packed  symmetric  storage  reducing  the  required  memory  by  a  factor  of  2.  Although  similar  packed  storage  of  the 
T  component  of  the  FMM  operator  is  possible,  it  has  not  been  implemented. 

In  the  following  subsections  all  times  are  CPU  seconds  on  a  Sun  Sparc  10.  Memory  is  measured  in  kilobytes. 


5.1  Boomerang 

Figure  4  shows  a  comparison  of  the  CPU  time  required  for  low-order  basis  functions  and  a  dense  direct  solver  to 
the  time  required  using  the  same  basis  functions  with  the  FMM.  The  computations  were  done  with  an  rms  relative 
error  of  0.1.  Using  these  low-order  basis  functions,  the  FMM  provides  a  savings  in  CPU  time  for  boomerangs  longer 
than  25A.  Figure  5  is  a  similar  comparison  of  the  memory  required  in  these  two  cases.  The  memory  required  in 
the  dense  case  is  proportional  to  the  length  to  the  power  2.2.  It  increases  a  little  faster  than  the  square  of  the 
length  because  of  the  need  to  use  a  finer  discretization  as  the  length  increases  in  order  to  keep  the  error  constant. 
The  FMM  memorv  is  proportional  to  the  length  to  the  power  1.6. 

Figure  6  is  a  comparison  of  the  CPU  time  required  for  low-order  basis  functions  and  high-order  basis  functions, 
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length,  lambda 

Figure  5:  Memory  required  for  dense  and  FMM  computation  of  the  cross  section  of  a  boomerang.  Lower  order 
basis  functions  were  used  in  both  cases. 

using  the  dense  direct  solver  in  both  cases.  The  computations  were  done  with  an  rms  relative  error  of  0.01.  Use 
of  high-order  basis  functions  dramatically  reduces  the  CPU  time  required,  but  does  not  change  the  asymptotic 
complexity  of  the  solution. 

Figure  7  shows  the  combined  effect  of  using  high-order  basis  functions  and  the  FMM,  comparing  the  CPU  time 
required  for  low-order  basis  functions  with  a  dense  matrix  and  high-order  basis  functions  with  the  FMM.  The 
computations  were  done  with  two  significant  digits  of  accuracy.  The  crossover  from  where  the  FMM  is  faster  than 
a  low-order  dense  solver  occurs  at  a  shorter  length  of  10 A  than  when  the  FMM  is  used  alone. 

5.2  Bat 

Scattering  from  the  bat  produces  results  that  are  very  similar  to  the  ones  shown  in  the  previous  section.  As  an 
illustration,  Figure  8  compares  the  memory  required  to  compute  scattering  from  a  bat  using  a  dense  matrix  and 
using  the  FMM.  High-order  basis  functions  are  used  in  both  cases.  Note  that  the  crossover  occurs  for  a  longer 
length  of  64 A  than  in  Figure  5  where  low-order  basis  functions  are  used.  The  most  efficient  discretization  of  the 
smaller  bats  uses  patches  that  are  so  large,  they  force  the  FMM  group  diameters  to  be  larger  than  is  necessary 
(and  optimal)  in  the  low-order  case.  The  slope  of  the  two  curves  are  2.0  and  1.5  as  expected. 

6  Conclusion 

Use  of  the  FMM  in  conjunction  with  high-order  discretizations  dramatically  increases  the  raaximumsize  of  problems 
which  can  practically  be  solved  on  a  given  computer  to  a  given  accuracy.  The  use  of  high-order  methods  and  the 
FMM  becomes  especially  important  for  problems  with  a  large  dynamic  range  of  cross  sections  such  as  the  bat 
and  boomerang  examples  used  here.  In  these  problems,  denser  discretizations  are  required  than  in  problems  with 
smaller  dynamic  ranges.  Use  of  high-order  methods  decreases  the  number  of  unknowns  required  to  obtain  sufficient 
accuracy.  Use  of  the  FMM  reduces  the  time  and  memory  required  to  solve  the  resulting  large  systems  of  linear 
equations.  Using  both  techniques  provides  more  improvement  than  is  possible  using  either  technique  separately. 
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Figure  6:  CPU  time  required  for  computing  the  cross  section  of  a  boomerang  with  low-order  and  high-order  basis 
functions 


Boomerang 


length,  lambda 


Figure  7:  CPU  time  required  for  computing  the  cross  section  of  a  boomerang  with  low-order  basis  functions  and 
a  dense  matrix  with  the  time  required  using  high-order  basis  functions  and  the  FMM 


612 


Bat  Memory 


1E+7 

lE+6 

lE+5 

Jg  IE+4 

o 


lE+3 

lE+2 

lE+1 

lE+0 


^ ^ - 

1  10  100  1000 
length,  lambda 


dense 

FMM 


10000 


Figure  8:  CPU  time  required  to  compute  the  cross  section  of  a  bat  using  high-order  basis  functions. 
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Abstract 


A  multilevel  algorithm  is  presented  for  analyzing  scattering  from  electrically  large  conducting  and/or 
penetrable  objects  The  algorithm  accelerates  the  iterative  solution  of  integral  equations  that  arise  in 
comnutationa/electromat^netics.  The  algorithm  permits  a  fast  matrix-vector  multiply  by  decornposmg  the 
traditional  Method  of  Moment  matrix  into  a  large  number  of  blocks,  each  describing  the  interaction 
between  distant  scatterers.  The  multiplication  of  each  block  with  a  tnd  solution  vector  is  executed  u^ng 
a  multilevel  scheme  that  resembles  an  FFT  and  that  only  relies  on 

computational  complexity  and  the  memory  requirements  of  the  proposed  algorithm  are 
The^gorithm  is  applicable  to  the  analysis  of  scattering  from  two-  and  three-dimensional  structures,  to 
date  the  algorithm  has  been  successfully  applied  to  the  analysis  of  scattering  from  large,  two- 
dimensional,  perfectly  conducting  and  dielectric  scatterers. 

1.  Introduction. 

Recently,  the  electromagnetics  community  has  shown  a  renewed  interest  in  iterative  solvers  as  a 
means  for  Living  very  large  scattering  problems.  A  number  of  techniques  has  been  proposed  that 
permit  a  fast  evaluation  of  a  matrix-vector  multiplication  for  matrices  that  arise  in  the  Method  of 
?MoMi  based  solution  of  such  problems.  Therefore,  these  techniques  may  drastically  reduce  the 
Lm^tational  cost  associated  with  the  iterative  solution  of  large  scattering 

but  Le  not  limited  to,  the  Impedance  Matrix  Localization  (IML)  technique  [IL  the  Complex  Multipole 
Beam  approach  (CMBA)  [2],  the  Fast  Multipole  Method  (FMM)  [3],  and  the  Adaptive  Integral  Method 
(AIM)  [4?.  Both  the  IML  and  the  CMBA  rely  on  forming  directive  radiation  beams 
bas^  and  testing  functions  and  are  limited  in  scope  to  scattering  by  sufficiently  smooth  bodies.  TOe 
FMM  and  AIM  rely  on  a  multilevel  geometrical  partitioning  of  the  scatterer  and  an  expansion  of  the  field 
in  terms  of  spherical  harmonics,  and  are  apphcable  to  arbitrarily  shaped  scatterers. 

It  is  well-known  that  a  conventional  MoM  matrix  contains  all  the  information  needed  to  solve  a 
scattering  problem.  This  served  as  our  motivation  for  seeking  methods  that:  (i)  are  apphcable  to 
scatterers  ^(ii)  rival  the  FMM  and  the  AIM  in  efficiency,  and  (iii)  make  direct  use  of  the  MoM 
elements  in  conjunction  with  commonly  available  algebraic  techniques.  Based  upon  thep  requirements 
weSoped  Multilevel  Matrix  Decomposition  Algorithm  (MLMDA)  [5,  6],  that  exploits  the  limited 
number  ofD&^KCS  of  Freedom  (DoF)  that  characterize  a  field  observed  oyer  a  domain  that  is  well- 
separated'  from  a  source  domain.  In  the  MLMDA,  a  matrix-vector  multiplication  is  performed  by  (i) 
subdividino-  the  overall  MoM  matrix  into  a  large  number  of  submatnces,  (ii)  performing  the  required 
multiplicatfon  on  each  of  these  submatrices  separately,  and  (iii)  assembling  the  resulting  sub-vectors  into 
the  SOTght  after  vector.  Both  the  memory  requirements  and  the  computational  complexity  of  the 
algorithm  are  o{ N\or^  n),  where  N  is  the  number  of  unknowns  describing  the  current  on  the  scatterer 
The  M^MDA  evaluates  the  multiplication  of  the  above  mentioned  submatnces  with  a  tnal  solution  vector 
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using  an  algorithm  that  resembles  an  FFT.  The  MLMDA  can  easily  be  incorporated  into  existing  Method 
of  Moment  (MoM)  programs. 

This  paper  is  organized  as  follows.  Section  2  briefly  outlines  the  proposed  multilevel  scheme. 
Sections  3  discusses  the  computational  complexity  of  the  proposed  algorithm  and  compares  it  to  other 
fast  algorithms,  notably  the  FMM  and  the  AM.  Sections  4  and  5  describe  numerical  results  and  outline 
conclusions  and  suggestions  for  future  work,  respectively. 

2.  Formulation  of  the  Multilevel  Scheme. 

This  section  outlines  the  proposed  multilevel  solver  for  a  two’dimensional  TMz  scattering  problem. 
The  algorithm  can  be  generalized  to  solve  two-dimensional  TEz  and  three-dimensional  scattering 
problems  with  minor  effort.  The  proposed  multilevel  algorithm  relies  on  a  multilevel  geometrical 
decomposition  of  the  scatterer  and  the  corresponding  MoM  matrix  and  a  multilevel  multiplication 
algorithm  for  accelerating  the  matrix- vector  product  of  the  decomposed  MoM  matrix  with  a  trial  solution 
vector. 

The  multilevel  decomposition  of  the  scatterer  is  performed  by  enclosing  the  scatterer  in  a  large 
(typically  square)  box  and  by  recursively  subdividing  this  box  into  smaller  boxes.  Starting  from  the 
coarsest  box-discretization,  all  pairs  of  non-empty  boxes  that  are  geometrically  non-adjacent  to  one 
another  and  whose  parent  boxes  are  touching  one  another,  are  identified  at  every  level  (Fig.  1).  Each 
identified  pair  of  boxes  corresponds  to  a  submatrix  of  the  global  MoM  matrix  (Fig.  2),  and  the 
multiplication  of  each  of  these  submatrices  with  a  trial  solution  vector  is  performed  using  a  multilevel 
scheme  outlined  below.  The  multiplication  of  those  portions  of  the  MoM  matrix  that  are  not  embedded 
in  any  of  the  identified  submatrices,  with  a  trial  solution  vector,  is  performed  using  direct  multiplication. 

Upon  decomposing  the  MoM  matrix  into  submatrices,  the  multiplication  of  a  given  submatrix  with  a 
trial  solution  vector  is  performed  independently  from  that  of  all  other  submatrices  using  the  three-stage 
algorithm  outlined  below  and  illustrated  in  Fig.  3. 

During  the  first  stage  of  the  multiplication  algorithm,  each  source  in  the  source  block  is  identified  and 
replaced  by  equivalent  sources  that  reside  on  the  surface  of  the  finest  level  box  that  encloses  the  source. 
These  equivalent  sources  do  not  regenerate  the  fields  radiated  by  the  original  sources  everywhere  in 
space.  Rather,  the  excitation  coefficients  of  these  equivalent  sources  are  chosen  such  that  the  field 
radiated  by  the  equivalent  sources  equals  that  radiated  by  the  original  sources  over  the  observation  block. 
The  number  of  equivalent  sources  is  equal  to  the  number  of  DoF,  that  characterize  the  field  in  the 
observation  block  due  to  the  actual  sources  that  reside  in  each  of  the  finest  level  blocks  contained  in  the 
source  block.  It  can  be  shown  that  [7,  8]: 

OC  ^source^observation  ^  In^- j  (1) 

where  ^observation  circles  circumscribing  the  finest  level  source  blocks  and 

the  observation  block,  r  is  the  distance  between  the  blocks,  and  £  the  tolerated  error  in  the  field  that  is 
regenerated  by  the  equivalent  sources  over  the  observation  region.  Upon  estimating  equivalent 
sources,  in  number,  are  placed  on  the  surface  of  each  finest  level  non-empty  box  contained  in  the 
source  box,  and  their  excitation  coefficients,  4^,  are  computed  from  those  of  the  original  sources,  4^, 
using: 


le,={ZlfZ2lso 


(2) 
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where  Z,  is  the  MoM  matrix  describing  the  field  due  to  the  equiydent  sources  at  a  simlarly  selected  set 
of  N.  observation  points  on  the  observation  box.  Z,  is  the  MoM  matnx  descrying  the  field  due  to  the 
actuafsources  at  these  same  observation  points,  and  t  denotes  a  Moore-Penrose  pseudo  inverse. 
At  the  end  of  the  first  stage,  all4ources  in  the  source  box  corresponding  to  the  submatrix  have  been 
mplacerby  eqdvaJent  soufc^s  residing  on  the  surface  of  the  finest  level  box  that  contains  the  source. 

During  the  second  stage  of  the  execution  of  the  multiplication  algorithm  the  finest  level  source 
blocte  are^recursively  combined,  while  the  observation  block  is  recursively  subdivided,  until  the  origmid 
source  block  has  beL  recovered  and  the  finest  level  observation  blocks  contained  m  the  observation 
block  are  reached.  In  doing  so,  the  product  of  the  number  of  non-empty  source  and  observation  regions 
considered  at  each  level  remains  approximately  constant.  For  each  pair  of  non-empty  source  and 
obseivation  regions  considered  at  each  level,  a  set  of  coefficients  is  computed  which  when 
associated  with  the  ,  sources  on  the  surface  of  the  source  box  being  considered,  regenerate  the  field 
Sh™e“  ation  bL  being  considered.  The  number  of  DoF  of  the  fields  considered  at  each  level 
remains  constant:  as  the  size  of  the  source  region  increases,  the  size  of  the  observation  regions  being 
considered  shrinks,  and,  by  virtue  of  equation  (1),  remains  constant.  The  excitation  coefficien 
for  eLch  pair  of  source  and  observation  regions  at  a  giveii  level  are  computed  using  equa  ion  (2)  where 
the  role  of  the  actual  sources  is  replaced  by  that  of  the  equivalent  sources  considered  at  the  prej»ous 
level  At  the  end  of  the  second  stage,  a  large  number  of  equivalent  source  excitation  coeffiaents 
has  been  computed,  each  of  which,  when  associated  with  the  excitation  coefficients  on  the  source 
box,  regenerates  the  original  field  in  a  different  finest  level  observation  box. 

During  the  third  stage  of  the  multiplication  algorithm,  each  of  these  sets  of  excitation  coefficients  is 
used  to  regenerate  the  fields  due  to  all  original  sources  contained  m  the  source  box  at  the  observation 
points  in  the  corresponding  non-empty  finest  level  box  in  the  observation  block. 

The  execution  of  the  above  three  stages  results  in  the  butterfly-like  submatnx- vector  multiplication 
scheme  that  exhibits  a  resemblance  to  the  well-known  FFT  algorithm. 

3.  Computational  Complexity  and  Comparison  to  Other  Techniques. 

The  computational  complexity  of  the  MLMDA  is  O(Alog^A).  Let  Q  denote  the  cost  of  muMp^^^^^^^^ 

the  submatrices  describing  coarsest  level  interactions.  It  is  easily  venfied  that  C,  Alog  A .  The  cost 
c!  of  Zltiplying  those  blocks  describing  interactions  at  the  next  to  coarsest  level  is  proportional  to  C,^ 
althouc^h  the  cost  of  multiplying  a  single  submatrix  with  a  trial  solution  vector  is  only  halve  of  that  at  the 
fo^selUevel  there  are  mice  as  mlny  non-empty  blocks  at  the  next  to  coarsest  level,  and.  hence 
C  C,  A  similar  reasoning  holds  tme  for  all  subsequent  levels,  and  since  there  are  og  eve  s 
o^rall  cost  of  the  algorithm  becomes  O  Alog^  A).  It  is  easily  verified  that  the  meinory  require^ments  of 

the  algorithm  are  also  O(Alog^A).  The  computational  cost  associated  with  the  evduation  of  LeW^ue 
to  ncM  neighbors,  and  the  storage  requirements  associated  with  this  operation,  are  both  0(A)  and  hence 
subdomin^t  to  computational  and  storage  costs  associated  with  the  evaluation  of  the  field  due  to  distant 

sources. 

The  MLMDA  exhibits  similarities  with  the  FMM,  especially  with  the 
(RPFMA)  developed  by  Wagner  and  Chew  [9].  When  compared  to  the  FMM  and  RPFMA.  the 
MLMDA  augments  a  traditional  MoM  code  solely  by  well-known  algebraic  techniques.  Hence, 
MLMDA  is  straightforward  to  implement  and  the  differences  between  two-  and  three-dimensiOTa 
implementations  of  the  MLMDA  are  minimal.  The  computational  complexity  of  the  nmltilevel 
(MLFMM)  is  OfAlog^  A)  but  can  decrease  to  O(AlogA)  for  very  lar^  scatterers  [10].  In  con  . 

co^tiion^^compWxity  of  the  MLMDA  cannot  decrease  below  Jhis  is  a  forect 

conseauLce  of  the  non-recyclable  nature  of  the  MLMDA  when  compared  to  MLFMM.  The  MLMDA 
a°so  Sus  sinSarities  with  the  AIM  [4].  Like  the  AIM.  the  MLMDA  represents  distant  fields  due  to 
Surces  raiding  on  the  scatterer  in  terms  of  those  due  to  sources  residing  on  a  regular  grid.  However, 
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in  contrast  to  the  AIM,  which  exhibits  a  computational  complexity  of  log  N")  for  two-dimensional 

surface  scatterers,  the  MLMDA  does  not  perform  an  FFT  on  the  entire  grid,  which  results  in  a  more 
favorable  computational  complexity  for  the  MLMDA. 

4.  Numerical  Results. 

This  section  summarizes  sample  results  obtained  using  an  implementation  of  the  above  described 
algorithm  for  the  analysis  of  TMz  scattering  from  perfectly  conducting  and  coated  strip-like  structures. 
Numerical  results  obtained  using  the  MLMDA  for  the  TEz  polarization  will  be  described  in  an 
forthcoming  paper. 

Consider  the  conducting  L-shaped  profile  and  its  coated  version  shown  in  Figures  4a  and  4b.  Each 
side  of  the  profile  measures  /  the  total  length  of  the  profile  therefore  is  2  /  Xq.  The  coating  consists 
of  a  perfect  dielectric  of  permittivity  er=4  and  thickness  r=0.05  Xq-  The  pec  strip  was  modeled  using  10 
unknowns  per  wavelength,  and  the  polarization  current  in  the  coating  was  modeled  using  a  single  layer 
of  volumetric  cells.  The  structure  is  illuminated  by  a  TMz  plane  wave  as  shown  in  Fig.  4.  The  CPU¬ 
time  requirements  for  solving  this  scattering  problem  using  the  MLMDA  are  shown  in  Figure  5  as  a 
function  of  N.  The  maximum  number  of  unknowns  used  to  solve  this  scattering  problem  was 
16,384,  however,  we  have  analyzed  structures  with  up  to  50,000  unknowns  using  the  MLMDA. 
The  graph  also  indicate  the  number  of  levels  used  in  the  analysis,  the  maximum  being  10.  All  analysis 
were  performed  on  a  SUN  SPARC  10  Workstation.  The  conduction  and  polarization  currents  (not 
shown)  are  in  perfect  agreement  with  those  obtained  using  a  standard  MoM  program  up  to  A^=4192, 
beyond  this  point  the  direct  solver  becomes  impractical  on  the  workstation.  Analysis  of  the  data  reveals 
that  the  memory  requirements  and  the  CPU-time  closely  tracks  the  theoretical  0(A^Iog^A^j  trend. 
Although  no  formal  analysis  of  error  propagation  of  the  algorithm  has  been  developed,  the  algonthm  has 
proven  to  be  stable  for  all  investigated  cases. 

5.  Conclusions. 

A  multilevel  matrix  decomposition  algorithm  was  presented  for  analyzing  scattering  from  electrically 
large,  conducting  and/or  penetrable,  objects.  The  algorithm  permits  a  fast  matrix- vector  multiply  by 
decomposing  the  traditional  MoM  matrix  into  a  large  number  of  blocks,  each  of  which  describes  the 
interaction  between  distant  scatterers.  The  multiplication  of  each  block  with  a  trial  solution  vector  is 
executed  using  a  multilevel  scheme  that  resembles  an  FFT  and  that  only  relies  on  well-known  algebraic 
techniques.  Fields  due  to  nearby  sources  are  evaluated  directly.  The  computational  complexity  and  the 
memory  requirements  of  the  proposed  algorithm  are  0(A^log^  A/).  The  algorithm  is  applicable  to  the 
scattering  analysis  of  both  two-  and  three-dimensional  structures.  Numerical  results  obtained  for  two- 
dimensional  scatterers  have  demonstrated  the  usefulness  of  the  proposed  approach.  A  three-dimensional 
version  of  the  algorithm  is  currently  being  implemented  and  applied  to  the  analysis  of  scattering  from 
large  free-standing  structures  and  structures  embedded  in  a  multilayered  medium. 
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scatterer 


examples  of  level  2  and  level  1 
boxes  whose  corresponding  MoM 
interaction  matrices  is  computed  using  the 
Multilevel  algorithm 


example  of  two  level  zero  boxes 
whose  interaction  is  computed  directly 


Fig.  1.  Strip-like  stnjcture  in  a  recursively  subdivided  box,  and  examples  of  boxes  whose  interactions 
are  computed  using  the  multilevel  algorithm. 
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Fig.  2.  Example  of  a  decomposed  matrix  for  a  flat  strip  structure. 
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Fig.  3.  Multiplication  of  a  submatrix  describing  the  interaction  between  two  distant  subsections  of  the 
scatterer  using  MLMDA.  During  stage  1,  the  actual  sources  are  replaced  by  equivalent  sources  on  a 
regular  grid,  during  stage  2,  a  multilevel  scheme  replaces  these  sources  by  equivalent  sources  on  a 
coarser  grid  while  shrinking  the  corresponding  observation  domain,  and  during  stage  3  the  fields  due  to 
each  set  of  equivalent  sources  is  computed  at  the  respective  observers. 
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(a)  (b) 

Fig.  4.  Uncoted  and  coated  L-shaped  profiles. 


Fig.  5.  CPU-time  per  iteration  for  MLMDA  for  the  pec  strip  and  coated  strip  shown  in  Figs  4a  and  4b, 

respectively. 
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1  Introduction 

The  fast  multipole  method  is  no  longer  new,  and  several  implementations  have  been  discussed  in  the 
literature  for  accelerating  method  of  moments  calculations.  See  for  example  [1],  [2],  and  [3]  for  2-D 
implementations,  and  [4],  and  [5]  for  3-D  implementations.  In  [3]  a  2-D  multilevel  implementation 
is  discussed,  the  other  articles  discuss  one  level  implementations. 

The  general  principals  are  now  quite  well  understood.  The  addition  theorem  for  spherical  Bessel 
functions  provides  a  mechanism  for  translating  the  origin  of  multipole  expansions.  Translation  allows 
us  to  take  several  multipole  expansions  and  translate  them  to  a  common  reference  frame  and  then  to 
aggregate  them  into  a  single  expansion.  It  is  the  aggregation  of  expansions  that  ultimately  provides 
the  speedup. 

The  objective  of  this  paper  is  to  address  the  difficulties  that  arise  when  a  multilevel  implementation 
is  attempted. 


2  One  Level  FMM 


When  FMM  is  implemented  with  one  level  the  N  unknown  sources  are  divided  into  approximately 
clusters  with  approximately  sources  in  each  cluster.  The  outer  multipole  expansion  for  each 
cluster  will  have  coefficients.  The  complexity  of  computing  an  outer  multipole  expansion 

for  each  source  centered  at  its  cluster  center  is  making  the  complexity  of  computing  all 

outer  expansions 

After  the  outer  expansions  have  been  computed  for  each  cluster,  inner  expansions  are  computed  as 
shown  in  figure  1.  The  complexity  of  computing  an  inner  expansion  centered  at  a  remote  cluster, 
when  both  outer  expansion  and  inner  expansion  have  coefficients  is  0(N).  Since  0(N) 

of  these  translations  are  required  this  results  in  a  complexity  of  O(N^)  for  the  translation  process. 
This  is  too  high  for  a  fast  method. 

The  solution  to  this  problem  was  provided  by  Rokhlin  in  [6].  Rokhlin  showed  that  one  could  use 
the  radiation  function  instead  of  the  multipole  expansion,  and  that  this  reduced  the  complexity  of  the 
translation  to  making  the  complexity  of  the  entire  translation  process 
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Figure  1  One  Level  FMM 


3  The  Radiation  Function 

The  concept  of  the  radiation  function  is  not  new.  It  is  well  known  that  the  radiation  function  of 
a  scattered  field  uniquely  defines  the  field  [7],  but  the  use  of  the  radiation  function  to  accelerate 

calculations  is  novel. 

If  the  scattered  field  is  defined  by  an  outer  multipole  expansion 

=  E  o^hn{kr)Yr{e,<^) 

n=0  7n~—n 

then  the  outer  radiation  function  is  given  by 

n=0  m=-n 

The  radiation  function  is  a  function  defined  on  the  unit  sphere,  and  it  is  convenient  to  use  the  notation 
0(J)  where  s  is  any  unit  vector  (  i.e.  s  lies  on  the  unit  sphere).  For  the  pu^ose  coniputation,  a 
finite  array  of  points,  ai, . . . ,  4  are  selected  and  the  array  of  values  0(si), . . . ,  0(si.)  is  the  radiation 
function.  The  number  and  location  of  the  points  in  the  array  must  be  selected  so  that  it  is  possible 
to  reconstruct  the  multipole  coefficients  0™  from  the  radiation  function.  In  particular  this  means  that 
the  number  of  multipole  coefficients  must  not  exceed  the  number  of  points  in  the  radiation  function. 

fc  >  (P  +  1)^ 

If  the  field  inside  a  region  is  given  by  the  inner  multipole  expansion 

/(r,  e,  <?))  =  E  E  ^nin{kT)Y^{d,<i>) 

n=0  m=-n 
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then  the  inner  radiation  function  is  given  by 

P  n 

%^)  =  E  E 

71=0  m=— n 

To  the  best  of  our  knowledge  the  inner  radiation  function  was  invented  by  Rokhlin  [6]  and  has  no 
obvious  physical  interpretation. 

The  advantage  provided  by  using  the  radiation  function  instead  of  the  multipole  expansion  is  that 
computing  the  inner  radiation  function  from  an  outer  radiation  function  is  quite  simple,  and  has  a 
computational  complexity  of  0(k)  . 

4  Two  Level  FMM 


The  next  step  is  to  translate  the  radiation  function  for  each  cluster  to  the  center  of  its  parent  group 
(see  figure  2).  The  outer  radiation  function  for  each  group  will  require  array  values.  Thus 

the  radiation  functions  will  have  to  be  interpolated  to  a  denser  grid  before  they  can  be  translated  and 
aggregated.  This  interpolation  process  maps  an  array  of  length  into  an  array  of  length 

O^N^/3^  and  requires  0(N)  operations  unless  a  fast  algorithm  can  be  found.  Since 
of  these  interpolations  are  required  an  0(N)  interpolation  algorithm  would  result  in  an 
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complexity  for  this  stage  of  FMM,  making  it  more  complex  than  the  one  level  algorithm.  Clearly  if  an 
overall  complexity  of  is  to  be  achieved,  an  interpolation  algorithm  is  required. 

No  such  algorithm  has  been  published  to  out  knowledge,  and  we  will  not  be  presenting  one  here. 
However,  we  will  present  an  interpolation  algorithm  for  interpolating  from  an  array  of  length  k  to  an 
array  of  length  k  with  complexity  0{k)  log  (^). 

Once  the  outer  radiation  functions  have  been  computed  for  each  group  they  are  used  to  compute 
inner  radiation  functions  for  each  remote  group.  The  complexity  is  per  translation  with 

translations  required  giving  an  overall  complexity  of  for  this  step. 

The  next  step  is  to  take  the  inner  radiation  function  for  each  group  and  to  translate  it  to  the  center  of 
each  cluster  the  group  contains  (see  figure  2).  The  group  inner  radiation  functions  contain 

array  values,  the  cluster  inner  radiation  functions  require  only  array  values.  After  the 

radiation  functions  have  been  translated  it  is  necessary  to  reduce  the  grid  density  from 

back  to  If  this  is  not  done  the  next  step  will  have  a  complexity  that  exceeds 

Reducing  the  grid  density  requires  removing  the  high  harmonics  from  the  radiation  function  and  then 
regriding  on  a  courser  grid.  Removing  the  high  harmonics  is  necessary  before  regriding  to  avoid 
aliasing.  This  process  of  removing  the  high  harmonics  and  regriding  is  called  filtering.  The  filtering 
process  is  a  mapping  from  an  array  of  points  to  an  array  of  points  and  must  be 

accomplished  with  complexity  to  achieve  an  overall  complexity  of  O^N^/^y  No  filtering 

algorithm  has  been  published  to  out  knowledge  either.  We  will  present  an  algorithm  for  filtering  an 
array  of  length  k  to  an  array  of  length  k  with  complexity  0{k\og  [k)). 


The  final  step  is,  for  each  source  point,  to  translate  the  inner  radiation  function  of  its  cluster  to  it’s 
location,  and  then  to  integrate.  There  are  N  source  points,  and  if  the  inner  radiation  functions  contain 
array  values  then  the  complexity  of  this  step  is 

Thus  in  order  to  achieve  an  algorithm  it  is  necessary  to  be  able  to  interpolate  from  an 

array  to  an  array  in  operations,  and  to  filter  an  array  in 

0(N^/^)  operations. 


5  Interpolation  and  Filtering 


Interpolation  can  be  defined  as  follows.  Given  a  radiation  function  of  degree  P  on  a  grid  fi, . . .  ,4 
with  A:  >  (P  +  1)^,  evaluate  the  radiation  function  on  a  new  grid  si, . . . ,  with  k  >  k. 

Filtering  can  be  defined  as  follows.  Given  a  radiation  function  of  degree  P  on  a  grid  with 

A:  >  (P  +  1)^,  remove  all  harmonics  n  >  P  and  then  evaluate  the  radiation  function  on  a  new  grid 
with  A;  >  >  (P  +  l)^.  Either  operation  can  be  accomplished  by  first  computing  the 

required  multipole  expansion  coefficients,  and  then  evaluating  the  radiation  function  on  the  new  grid. 
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Given  a  radiation  function  F,  its  multipole  coefficients  can  be  computed  by 

^  /  Y-^{r)F{T)da 

s 

where  S  is  the  unit  sphere  [8](2.16).  Then  if  we  let  H  =  P  for  interpolation  or  H  =  P  for  filtering 
we  get 


(1) 


^(?)  =  E  E  =  ^  j Fi?)Y,{‘in  +  l)  ^  y--(f)y-(?)cfa- 

71=0  m=-7i  ^  7i=0  m-~n 


jg  77=0 

=  ^  J F{r)Sn{T  ■t)da^ 
s 


where 

^  Pn+i(x)-Pf)(:r) 
a;  —  1 

These  equalities  follow  from  the  addition  theorem  for  spherical  harmonics  and  the  Christoffel-Darboux 
formula  for  Legendre  polynomials. 

Thus  both  interpolation  and  filtering  are  simple  integral  transforms,  and  we  seek  yet  another  fast 
integral  transform. 


6  Fast  Interpolation  and  Filtering 

Fast  algorithms  can  be  developed  for  interpolation  and  filtering  by  exploiting  the  symmetry  of  the 
integral  transform  (1).  The  kernel  of  the  transform  is  (^-5).  If  U  is  any  real  unitary  transformation 
in  3  dimensions  (i.e.  a  rotation  and/or  refection)  then  (Ur)  •  (P?)  =  f -f.  Thus  the  symmetry  of 
rotation  and  reflection  can  be  exploited  to  accelerate  interpolation  and  filtering. 

The  first  step  is  to  select  a  finite  symmetry  group  G  which  is  a  subgroup  of  0(3).  O  is  a  group  of 
3x3  real  orthogonal  matrices.  The  next  step  is  to  replace  (1)  with  a  quadrature  rule  that  is  symmetric 
with  respect  to  G.  If  the  order  of  the  group  G  is  Q,  such  a  rule  has  the  form 

k 

F{r)daf  =  E^i  E 

j=l  U^G 

The  k  *  Q  quadrature  points  are  given  by  f/'fj  where  1  <  i  <  ifc  and  [/  €  O.  The  weights  are  wj. 
The  rule  must  be  of  sufficiently  high  order  to  give  exact  results.  Since  in  (1)  P  has  order  P  and 
Sq  has  order  H  the  rule  must  have  order  P  +  H  <  2P.  In  [9]  McLaren  shows  how  to  develop 
efficient  quadrature  rules  that  are  consistent  with  symmetry  groups  of  rotations  and  reflections  in  three 
dimensions. 


/ 
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When  symmetry  is  being  exploited  it  is  efficient  to  represent  radiation  functions  as  rectangular  arrays 
of  values  with  Q  columns  and  k  rows. 


Fju  =  F{U'r) 

If  we  wish  to  interpolate  or  filter  to  a  new  grid  defined  by  1 1 , . . . ,  %  and  G  then  ( 1 )  reduces  to  (see  [10]) 


47r 


(2) 


®  Ru 

lueG 


[ly  0 1]F 


Y  ®T-'^RuT 

IViG 


[W®T-'^]F 


where 

Ag  =  ^n(li  ■  U’tj 

The  similarity  transformation  T~'^RuT  transforms  each  of  the  regular  representations  Ru  to  block 
diagonal  form,  significantly  reducing  the  complexity. 

This  approach  can  be  applied  to  the  groups  of  rotations  and  reflection  associated  with  the  regular 
polyhedra,  or  to  the  groups  of  rotations  about  the  polar  axis  Cq  or  Cgh  (without  or  with  reflection 
about  the  equator). 


7  Symmetry  Groups  of  the  Regular  Polyhedra 

The  groups  in  this  category  are  the  tetrahedral  group  T  with  order  12,  Ta  with  order  24,  the  octahedral 
group  O  with  order  24,  Oa  with  order  48,  the  icosahedral  group  I  with  order  60,  and  la  with  order 
120  (see  [10]). 

When  equation  (2)  is  implemented  using  one  of  these  groups  the  computational  complexity  is 
O(^'^Q)  =  0f(ibQ)^)/Q.  Note  that  kQ  is  the  number  of  values  in  the  radiation  function,  and  the 
group  order  Q  is  a  constant.  Thus  there  is  no  theoretical  reduction  in  the  computational  complexity 
of  0({#valuesfy  but  there  is  a  significant  reduction  in  the  constant  (by  a  factor  of  120  for  la). 

In  spite  of  the  high  theoretical  complexity,  the  practical  advantage  of  a  very  low  constant  probably 
makes  this  the  method  of  choice  for  interpolating  and  filtering  radiation  functions  with  degree  less 
than  40.  This  implies  scatters  with  diameters  around  6  wave  lengths. 

We  have  not  yet  implemented  this  approach. 

8  The  Groups  Cq  and  Cgh 

These  groups  have  three  advantages  over  the  groups  in  the  previous  section.  First,  groups  of  arbitrarily 
high  order  can  be  achieved  by  selecting  Q  to  be  large.  Quadrature  rules  of  sufficiently  high  order 
can  be  found  with  Q  «  A:,  so  the  computational  complexity  for  filtering  and  interpolation  becomes 
0(A:2q)  =  0^(A:Q)^^^^  =  O  values)^ .  Second,  the  transformation  T  in  equation  (2)  reduces 

to  the  discrete  Fourier  transform  for  Cg  and  Cgh,  and  thus  operations  involving  T  or  can  be 
performed  in  0(Qlog(Q))  operations,  and  furthermore  R-'^RuT  is  diagonal.  Third,  because  Cg  is 
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a  subgroup  of  C2Q  it  is  easy  to  change  groups  as  the  FMM  calculation  proceeds  from  level  to  level, 
making  this  approach  well  suited  for  multilevel  FMM. 

Interpolation  and  filtering  based  on  the  groups  Cq  or  Cgh  provide  a  theoretical  improvement  in 
the  computational  complexity  of  FMM.  Using  this  algorithm  two  level  FMM  has  a  complexity  of 
and  multilevel  FMM  has  a  complexity  of  Thus  multilevel  FMM  and  one 

level  FMM  have  the  same  order  of  complexity.  However,  the  constant  is  significantly  lower  for 
multilevel  FMM. 

This  approach  to  filtering  and  interpolation  has  been  implemented  and  proven  to  work. 

9  Faster  than  Fast 

When  using  the  group  Cq  or  Cgh  the  term  in  equation  (2) 

ueG 

can  be  shown  [11]  to  take  the  form 

'AW 

(3)  ®  = 

U^G 

where 

(4)  A^^^  =  ~ 

Zj  —  Zj 

where  Zi  and  zj  are  the  z-axis  components  of  %  and  rj  respectively. 

The  matrix  is  k  by  k,  but  its  action  can  be  computed  in  0{k)  operations  by  expanding 

1  _ 1 _  1  1 

“  Zj  {zi  -  zq)  -  {zj  -  zo)  ~  Zi-^zo  1  - 

Zi — Zq 

=  ^  ~  Y 

Zi  -  zo  ^  ~zo) 

This  is  just  a  multipole  expansion  in  one  dimension  and  converges  rapidly  if  «  1.  Using  this 
approach  multiplication  by  A^^^  is  reduced  to  0(A:)  and  the  complexity  of  the  entire  interpolation  or 
filtering  process  is  kQ\og{Q)  +  0(/:Q). 

Although  this  algorithm  has  a  superior  complexity,  the  constant  associated  with  the  O(ibQ)  term  is 
sufficiently  high  that  it  will  not  be  effective  until  the  degree  of  the  radiation  functions  exceeds  200, 
or  the  scatter  is  larger  than  30  wavelengths. 

This  approach  reduces  the  computational  complexity  of  two  level  FMM  to  O  log  (N)  j  and 

multilevel  FMM  to  O^Nlog(N)^) 

This  algorithm  is  currently  being  implemented. 


RuT 

A^i) 
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10  Conclusions 

In  order  to  have  a  fast  multilevel  implementation  of  the  fast  multipole  method  one  needs  to  use 
radiation  functions  of  different  orders  at  the  different  levels.  This  requires  an  interpolation  operation 
to  increase  order  going  from  small  source  clusters  to  large  clusters.  A  filtering  operation  is  also 
required  going  from  large  clusters  of  field  points  to  small  clusters.  These  two  operations  are  very 
similar  and  they  can  be  performed  with  a  common  algorithm.  If  a  quadrature  rule  with  a  sufficiently 
high  order  is  used  equation  (2)  is  an  exact  representation  of  that  algorithm. 

Equation  (2)  exhibits  symmetry  and  that  symmetry  can  be  exploited  to  accelerate  the  algorithm. 
Equations  (3)  and  (4)  show  how  the  symmetry  group  Cq  or  Cgh  can  be  used  to  reduce  the  complexity 
of  multilevel  EMM  to  ofNIog  (N)^j.  These  algorithms  are  exact  in  that  they  give  the  same  results 
as  would  be  achieved  by  using  truncated  multipole  expansions  instead  of  radiation  functions. 
Ultimately  we  expect  that  approximate  algorithms  will  emerge  that  are  faster  than  the  exact  algorithms 
discussed  in  this  paper.  When  those  approximate  algorithms  are  proposed  it  will  be  necessary  to  have 
exact  algorithms  such  as  those  presented  here  so  that  the  "degree  of  approximation  of  the  algorithm 
can  be  assessed. 
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ABSTRACT —  The  electromagnetic  (EM)  scattering  from  three-dimensional  (3D)  conduct¬ 
ing  bodies  has  been  studied.  The  electric  field  integral  equation  (EFIE),  magnetic  field  integral 
equation  (MFIE),  and  combined  field  integral  equation  (CFIE)  are  approximated  by  matrix 
equations  using  the  method  of  moments  (MOM).  The  fast  multipole  method  (FMM)  has  been 
implemented  to  speed  up  the  matrix- vector  multiply  in  the  CG  method  when  it  is  used  to 
solve  EFIE,  MFIE,  and  CFIE.  FMM  reduces  the  complexity  of  a  matrix- vector  multiply  from 
O(iV^)  to  With  a  multilevel  fast  multipole  algorithm  (MLFMA),  the  complexity  is 

further  reduced  to  0{NlogN). 

1.  Introduction 

The  EM  field  scattering  by  arbitrarily  shaped  conductor  can  be  obtained  by  finding  the 
solution  of  an  integral  equation  where  the  unknown  function  is  the  induced  current  distribu¬ 
tion.  The  integral  equation  is  discretized  into  a  matrix  equation  by  the  method  of  moments 
(MOM).  The  resultant  matrix  equation  is  then  solved  by  Gaussian  elimination,  which  requires 
0{N^)  floating-point  operations  if  Gaussian  elimination  is  used  to  solve  N  linear  equations, 
or  0{N^)  operations  per  iteration  if  the  conjugate  gradient  (CG)  method  is  used. 

The  fast  multipole  method  (FMM)  [1-6]  speeds  up  the  matrix-vector  multiply  in  the 
conjugate  gradient  (CG)  method  when  it  is  used  to  solve  the  matrix  equation  iteratively.  In 
this  paper,  FMM  is  applied  to  solve  the  electromagnetic  scattering  from  three  dimensional 
arbitrary  shape  conducting  bodies.  The  electric  field  integral  equation  (EFIE),  magnetic  field 
integral  equation  (MFIE),  and  combined  field  integral  equation  (CFIE)  are  considered.  FMM 
formula  for  CFIE  has  been  derived,  which  reduces  the  complexity  of  a  matrix- vector  multiply 
from  O(iV^)  to  (9(A^^'^),  where  N  is  the  number  of  unknowns.  With  a  nonnested  method,  using 
the  ray-propagation  fast  multipole  algorithm  (RPFMA)  [5,6],  the  cost  of  a  FMM  matrix- vector 
multiply  is  reduced  to  0(7V^/^).  We  have  implemented  a  multilevel  fast  multipole  algorithm 
(MLFMA),  whose  complexity  is  further  reduced  to  O(NlogN).  The  FMM  also  requires  less 
memory,  and  hence,  can  solve  a  larger  problem  on  a  small  computer. 

2.  The  Combined  Field  Integral  Equation  (CFIE) 

Practical  electromagnetic  problems  are  often  three-dimensional  and  involve  arbitrary  ge¬ 
ometry.  The  arbitrary  surface  is  described  by  dividing  it  into  a  number  of  connected  patches 
which  are  mathematically  described  as  parametric  quadratic  surfaces  [6,7].  For  conducting 
objects,  the  electric  field  integral  equation  (EFIE)  is  given  by 


t  This  work  was  supported  by  NASA  under  grant  NASA  NAG  2-S71,  Office  of  Naval  Research  under  grant 
N00014-89-J1286,  the  Army  Research  Office  under  contract  DAAL03-91-G-0339,  and  the  National  Science 
Foundation  under  grant  NSF  ECS  92-24466. 
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(1) 


i  ■  j  G(r,  r')  ■  J(r')<iS'  =  —t  ■  E'(r), 
for  r  on  surface  S <  where  f  is  any  unit  tangent  vector  on  *S,  and 

G(r,r')  =  (I-;^VV')<?(r,r'),  (2) 

g{r,r')  =  -^,  R  =  \r-r’\.  (3) 

For  closed  conducting  objects,  the  magnetic  field  integral  equation  (MFIE)  is  given  by 

27i-J(r)  -  n  X  V  X  j  dS'g{r,r')3{r')  =  Attu  x  H‘(r)  (4) 

for  r  approaches  to  5  from  outside,  where  h  is  an  outwardly  directed  normal.  MFIE  (4)  can 
be  written  as 

27rt  ■  J(r)  -i-hxV  X  dS'g{r,  r')J(r')  =  47rf  •  fi  x  H'(r).  (5) 

Then  combined  field  integral  equation  (CFIE)  for  closed  conducting  objects  is  simply  a  linear 

combination  of  EFIE  (1)  and  MFIE  (5)  and  is  of  the  form. 

aEFIE  +  (l-a)^MFIE.  (6) 

The  combination  parameter  a  ranges  from  0  to  1  and  can  be  chosen  to  be  any  value  within 
this  range.  It  is  found  a  =  0.2  to  be  an  overall  good  choice. 

Let  us  apply  the  method  of  moments  (MOM)  to  the  CFIE.  The  unknown  current  J(r)  is 
first  expanded  in  an  appropriately  chosen  set  of  basis  functions  {jj.  The  CFIE  for  J(r)  is 
discretized  by  substituting  the  expansion  in  terms  of  unknowns  a^.  Then,  rather  than  forcing 
the  CFIE  to  be  satisfied  for  r  approaching  S  from  the  outside,  it  is  multiplied  by  a  set  of  N 
testing  functions  {tj}  and  the  inner  products  are  taken: 

=  i  =  (7) 

i=l 

where 

Aji  =  a  j  dSt,(r)  ■  J^dS’Gir,r')  ■  j,(r') 

+  (1  -  a)  J  dStjir)  W  ^  Js 

Finally,  the  integral  equations  are  approximated  by  matrix  equations  (7)  using  the  method 
of  moments  (MOM).  The  specially  designed  basis  functions  for  subdomains  which  contain 
surface  curvature  are  the  generalized  '"Tooftop’'  basis  functions  with  a  constant  charge  density 
and  without  line  charge  accumulation  on  the  surfaces  [7]. 
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3.  The  Fast  Multipole  Method  (FMM) 

Letting  rj  and  Vi  be  the  field  point  and  source  point,  respectively,  we  have 

^ji  ”1“  ^m'  ”  ^jm  ”f"  ^mm'  ^im'-  (9) 

Thus,  using  the  addition  theorem  [2,8]  the  scalar  Green’s  function  can  be  rewritten  as 

pikrji  ih  t  ^  . 

~r~  ~  In  j  '  k),  (10) 

where 

L 

amm'irmm'  *  ^)  =  +  l)h\^\kr^m>)Pl{rmm'  *  ^0’  (H) 

/=0 


We  have  truncated  the  infinite  summation  in  (11).  The  integration  /  in  (10)  represents  the 
integrals  over  the  unit  sphere,  and  will  be  evaluated  by  Gaussian  quadratures  with  K  =  2L^ 
points.  Substituting  the  scalar  Green’s  function  (10)  into  the  dyadic  function  (2)  leads  to 


G(r,,ri)  =  ^J  • 

k). 

(12) 

Substituting  Equations  (10)  and  (12)  into  (8),  we  get  a  formula  to  calculate  the  matrix 
elements  in  non-nearby  group  pairs, 

zk  f  ''  ''  •' 

~  4^  /  ^  fmj{k)  ‘  O'mm'ik  * 

(13) 

where 

yjmiCk)  =  aj  -  kk)  ■  tiirjra)  -{l-a)kx  j 

"ijirjm)  X  h, 

(14) 

(15) 

When  we  use  CG  method  to  solve  the  matrix  equations  (7),  matrix- vector  multiply  is  needed 
to  be  computed  in  each  iteration.  Then,  we  can  write  the  matrix-vector  multiplication  as 


^^AjiUi  ~  X!  X!  d*  f  d  kv Jrnji^)  ‘  X!  ’  ^mm)  X^  •  (1^) 

The  first  term  in  (16)  is  the  contribution  from  nearby  groups  (including  the  self-group),  and 
the  second  term  is  the  far  interaction  calculated  by  FMM.  The  FMM  idea  is  first  to  divide  the 
subscatterers  into  groups.  Then,  addition  theorem  is  used  to  translate  the  scattered  field  of 
different  scattering  centers  within  a  group  into  a  single  center.  Hence,  the  number  of  scattering 
centers  is  reduced.  Similarly,  for  each  group,  the  field  scattered  by  all  the  other  group  centers 
can  be  first  “received”  by  the  group  center,  and  then  “redistributed”  to  the  subscatterers 
belonging  to  the  group.  It  is  proven  that  the  computation  cost  using  (16)  with  2-level  FMM 
is  in  order  [6]. 
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To  implement  a  multilevel  fast  multipole  algorithm  (MLFMA),  the  entire  object  is  first 
enclosed  into  a  large  cube,  which  is  partitioned  into  eight  smaller  cubes.  Each  subcube  is  then 
recursively  subdivided  into  smaller  cubes  until  the  edge  length  of  the  finest  cube  is  about  half 
of  a  wavelength.  Cubes  at  all  levels  are  indexed.  At  the  finest  level,  we  find  which  cube  each 
basis  function  reside  in  by  comparing  the  coordinates  of  the  center  of  the  basis  function  with 
the  center  of  cube  .  We  further  find  nonempty  cubes  by  sorting.  Only  nonempty  cubes  are 
recorded  using  tree-structured  data  at  all  levels  [9].  Thus,  the  computational  cost  depends  on 

onlv  the  nonempty  cubes,  not  all  cubes.  .  ,  ,  fini.  *1, 

"The  basic  algorithm  for  matrix- vector  multiply  is  broken  down  into  two  sweeps  [10).  the 
first  sweep  consists  of  constructing  outer  multipole  expansions  for  each  nonempty  cube  at 
levels  The  second  consists  of  constructing  local  multipole  expansions  contributed  from  the 
well-separated  cubes  at  all  levels.  When  the  cube  becomes  larger  from  the  finest  level  to  the 
coarsest  level,  the  numbers  of  multipole  expansions  should  increase.  In  the  first  sweep,  the 
outer  multipole  expansions  are  computed  at  the  finest  level,  then  the  expansions  for  larger 
cube  are  obtained  using  shifting  and  interpolation.  Let  and  be  the  cube  centers  a 

level  I  and  I  -  1,  respectively,  then  the  outer  multipole  expansions  for  coarser  level  should  be 


But  has  only  Kt  values,  we  need  Ki-i  values  of  Then  we  will  interpolate 

i(fc)  to  Ki-i  values  first,  then 


Vsm;_,i(fc(!-l)n')  ■ 


(18) 


where  interpolation  matrix  TL  is  a  sparse  matrix.  .  1  u 

At  the  coarsest  level,  the  local  multipole  expansions  contributed  from  well-separated  cubes 
are  calculated  using  the  second  part  of  (16).  At  the  second  sw'eep,  the  local  expansions  for 
smaller  cubes  include  the  contributions  from  parent  cube  using  shifting  and  anterpolation  11), 
and  from  the  well-separated  cube  at  this  level  but  not  well-separated  at  the  parent  ^ 

the  local  multipole  expansions  received  by  the  cube  center  al  level  i  -  1  is  B[k),  then  the 
contribution  from  all  well-separated  cubes  can  be  written  as 


■=/d= 


A'i-1 


kVjm^^uCk)  ■  B(fc)  =  Y. 

n'  =  l 


j(fc(i-iK)  ■  B(fc(/_1)„')> 


(19) 


where  w„'  is  the  weighting  function.  Substituting  the  interpolation  expression  for  j(li(i-i)n') 

into  (19),  and  changing  the  order  of  two  summations  lead  to 


E’ 


nV 


{kin)  ■  ^  ^ 


^(!-l)rr 


I  Wn- 


(20) 


The  above  operation  is  called  anterpolation  [11].  At  the  finest  level,  the  contributions  from 
non-well-separated  cube  are  calculated  directly.  Since  only  nonempty  cubes  are  considered, 
the  complexity  for  MLFMA  is  further  reduced  to  0{N\ogN). 
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4.  Results  and  Conclusions 

In  Figure  1,  we  plot  the  normalized  residual  norm  in  CG  method  as  functions  of  the 
number  of  iterations  for  EFIE,  MFIE,  and  CFIE.  It  is  found  that  CFIE  converges  much  faster 
than  EFIE  and  MFIE.  EFIE  and  MFIE  may  not  converge  or  converge  too  slowly  at  specific 
frequencies  corresponding  to  the  eigenvalues  of  the  interior  resonator  problem,  and  give  a 
wrong  solution.  This  phenomenon  is  called  the  internal  resonance  corruption,  because  the 
tangential  components  of  a  single  incident  field  (either  the  electric  or  magnetic  field)  at  the 
scatterer  surface  lack  sufficient  information  to  uniquely  determine  the  desired  surface  currents 
at  a  resonance  frequency.  It  has  been  proven  that  the  CFIE  eliminates  the  effect  of  these 
resonances.  At  all  frequencies,  CFIE  has  an  unique  solution,  and  converges  faster  than  EFIE 
and  MFIE  if  the  matrices  are  solved  by  an  iteration  method  since  the  matrix  for  CFIE  has 
smaller  condition  number  than  those  for  EFIE  and  MFIE. 

Figure  2  shows  the  comparison  of  the  CPU  time  versus  the  number  of  unknowns  for  the 
solution  of  CFIE  with  the  FMM,  standard  CG,  and  LU  decomposition  for  calculating  the 
bistatic  RCS  of  a  conducting  sphere.  It  is  seen  that  the  FMM  outperforms  the  standard 
CG  in  terms  of  matrix  filling  and  matrix  solving.  The  FMM  also  requires  less  memory,  and 
hence,  can  solve  a  larger  problem  on  a  small  computer.  The  simulation  is  performed  on  a 
SUN-SPARC-2  with  64  MB  of  RAM.  The  final  residual  norm  in  CG  method  is  less  than  10”“*. 

Figure  3  shows  the  validation  of  the  numerical  result  from  combined  field  integral  equation 
(CFIE)  with  FMM  against  the  Mie  series  solution  of  the  bistatic  RCS  of  a  metallic  sphere  of 
radius  Im  at  frequency  of  0.72 GHz  for  the  parallel  polarization.  9,408  unknowns  with  2-level 
FMM  are  used.  The  solutions  of  CFIE  with  FMM  agree  with  Mie  series  very  well. 

Figure  4  shows  the  RCS  of  a  one-meter  long  NASA  almond  at  2.5  GHz  in  the  xy  plane 
with  $  =  90°.  Five  unknowns  are  used  per  wavelength.  The  calculation  is  done  by  using  CFIE 
with  LU  decomposition  on  a  SUN- SPARC- 10  with  128  MB  RAM,  and  it  consumes  about  24 
hr  of  CPU  time.  Some  points  computed  using  CFIE  with  2-level  FMM  are  shown.  These 
points  took  0.5  hr/ point  to  compute,  and  much  less  than  3  hr /point  to  compute  with  EFIE 
using  FMM.  The  numerical  results  are  compared  with  experimental  measurement  from  Ohio 
State  University  [12].  There  is  a  good  agreement  between  our  results  and  the  measurement. 

Figure  5  shows  the  bistatic  RCS  of  a  one-meter  long  metallic  square  plate  at  4.5GHz  in 
the  xy  plane  with  incident  angle  0  =  45°.  32,512  unknowns  with  6-level  FMM  are  used.  It 
is  done  by  solving  EFIE  on  a  SUN-SPARC-2  with  64MB  RAM.  The  agreement  between  our 
results  and  the  approximation  by  physical  optics  is  good  when  the  RCS  is  bigger  than  0  dB. 

In  conclusion,  the  fast  multipole  method  (FMM)  has  been  implemented  to  speed  up  the 
matrix-vector  multiply  when  CG  method  is  used  to  solve  EFIE,  MFIE,  and  CFIE.  At  all 
frequencies,  CFIE  has  an  unique  solution,  and  converges  faster  than  EFIE  and  MFIE.  FMM 
approach  reduces  the  complexity  of  a  matrix- vector  multiply  from  O(iY^)  to  With  a 

multilevel  fast  multipole  algorithm  (MLFMA),  the  complexity  is  reduced  to  0{N\ogN).  The 
FMM  also  requires  less  memory,  and  hence,  can  solve  a  larger  problem  on  a  small  computer. 
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r=1m  sphere,  f=0.72GHz,  294  patches,  9408  unkws 


Figure  1.  Comparison  of  the  convergence  of  solutions  of  EFIE,  MFIE,  and 
CFIE  for  a  conducting  sphere  of  radius  1  m  at  0.72  GHz  for  parallel  polarization 
(9,048  unknowns,  7.2  unknowns  per  wavelength). 
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Figure  2.  Comparison  of  the  CPU  time  versus  the  number  of  unknowns  for 
FMM,  standard  CG  and  LU  decomposition  for  a  metallic  sphere. 


r=1m  sphere,  f=0.72GH2,  294  patches,  9408  unkws 


Figure  3.  Validations  of  CFIE  with  FMM  against  the  Mie  series  of  the  bistatic 
RCS  of  a  metallic  sphere  of  radius  Im  at  0.72GHz  for  VV  polarization.  The 
RCS  is  normalized  by  ttg^. 
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1  m  NASA  Almond,  f=2.5GHz,  110  Patches,  3297  Unkws 


^  (Degrees) 

Figure  4.  RCS  of  a  one-meter  long  NASA  almond  at  2.5  GHz  in  the  xy 
plane  with  6  =  90°.  Five  unknowns  are  used  per  wavelength.  The  results  are 
computed  with  LU  decomposition,  and  partially  with  FMM.  The  experimental 
measurement  by  Ohio  State  University  [12]  is  also  given  for  comparison. 


W=15X.  square  plate,  N=32512,  45°  incidence,  6  levels 


6  (Degrees) 

Figure  5.  Bistatic  RCS  of  a  metallic  square  plate  of  length  Im  at  4.5GHz  for 
VV  polarization  with  45°  incident  angle. 
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